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METHODS AND MICROORGANISMS FOR 
PRODUCTION OF PANTO-COMPOUNDS 

Background of the Invention • 

5 Pantothenate, also known as pantothenic acid or vitamin B5, is a member of the 

B complex of vitamins and is a nutritional requirement for mamihals, including 
livestock and humans {e.g., from food sources, as a water soluble vitamin supplement or 
as a feed additive). In cells, pantothenate is used primarily for the biosynthesis of 
coenzyme A (CoA) and acyl carrier protein (ACP). These coenzymes function in the 
10 metabolism of acyl moieties which form thioesters with the sulfhydryl group of the 4'- 
phosphopantetheine portion of these molecules. These coenzymes are essential in all 
cells, participating in over 100 different intermediary reactions in cellular metabolism. 

The conventional means of synthesizing pantothenate (in particular, the bioactive 
D isomer) is via chemical synthesis from bulk chemicals, a process which is hampered 
1 5 by excessive substrate cost as well as the requirement for optical resolution of racemic 
intermediates (e.g., resolution of DL-pantolactone to obtain D-pantolactone for chemical 
condensation with P-alanine). Accordingly, researchers have recently looked to 
bacterial or microbial systems that produce enzymes useful in pantothenate biosynthesis 
processes (as bacteria are themselves capable of synthesizing pantothenate). In 
20 particular, byconversion processes have been evaluated as a means of favoring 
production of the D isomer of pantothenic acid, e.g., using microorganisms which 
selectively hydrolyze a DL-pantothenic acid ester to D-pantothenic acid; 
microorganisms which selectively decompose L-pantolactone resulting in D- 
pantolactone alone; and microorganisms which selectively hydrolyze DL-pantolactone 
25 to D-pantoic acid. 

There is still, however, significant need for improved pantothenate production 
processes, in particular, for processes requiring reduced quantities of substrates and/or 
less expensive substrates. To this end, methods of direct microbial synthesis have 
recently been examined as a means of improving D-pantothenate production. In 
30 microbes, pantothenate biosynthetis is a multistep pathway resulting in condensation of 
pantoate (derived from a-ketoisovalerate) and P-alanine to form D-pantothenate. The 
isoleucine-valine (ilv) pathway biosynthetic enzymes, acetohydroxyacid synthetase (the 
ilvBN or alsS gene product), acetohydroxyacid isomeroreductase (the ilvC gene product) 
and dihydroxyacid dehydratase (the ilvD gene product) catalyze the conversion of 
35 pyruvate to a-ketoisovalerate. The reactions are further catalyzed by the pantothenate 
(pari) pathway biosynthetic enzymes ketopantoate hydroxymethyltransferase (the panB 
gene product), ketopantoate reductase (the panE gene product), aspartate-a- 
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decarboxylase (the panD gene product) and pantothenate synthetase (the panC gene 
product). 

The genes encoding the enzymes involved in the biosynthesis of pantothenic acid 
in Salmonella typhimurium and Escherichia coli have recently been identified and 
5 characterized (Frodyma and Downs (1998) J. Biol Chem. 273:5572-5576 and 

Jackowski (1 996) pp. 687-694, In Neidhardt et al (ed.) Escherichia coli and Salmonella: 
Cellular and Molecular Biology, 2 nd ed. Am. Soc. Microbiol. Wash, D.C). In E. coli, for 
example, the biosynthesis of pantothenic acid consists of four key steps. The first 
reaction is catalyzed by the panB gene product, ketopantoate hydroxymethyltransferafce, 
10 and uses the L-valine intermediate a-ketoisovalerate to generate ketopantoate, which is 
subsequently reduced to pamoate by the paufgene product, ketopantoate reductase. 
The panD gene product, aspartate-a-decarboxylase, generates p-alanine from aspartate. 
The panC gene product, pantothenate synthetase, subsequently ligates ^-alanine with 
pamoate to yield D-pantothenate. 
1 5 The authors Dusch et al. described the identification of the Corynebacterium 

glutamicum panD gene and reported that expression of the C. glutamicum pa/iDgene in 
E. coli yielded a strain producing pantothenate with a specific productivity of 140 ng of 
pantothenate per mg (dry weight) per hour. (Dusch et al. (1999) Appl. Environ. 
Microbiol. 65:1530-1539). 
20 The authors Sahm and Eggeling have further identified the Corynebacterium 

glutamicum panB and pan C genes and have described a genetically engineered strain of 
C. glutamicum which overexpresses the panBC genes (Sahm and Eggeling (1999) Appl 
Environ. Microbiol. 65:1973-1979). The engineered strain produces pantothenate, 
however, it was necessary to overexpress the genes responsible for a-ketoisovalerate 
25 production in the host organism in order that pantothenic acid production could be 
detected. Moreover, without the addition of P-alanine, no substantial amounts of 
pantothenate accumulated with the strain constructed. 

Likewise, a method of producing D-pantothenic acid has been described that 
takes advantage of a sodium salicylate resistant mutant strain of £ coli which produces 
30 D-pantothenic acid when cultured in the presence of p-alanine (U.S. Patent No. 

5,5 1 8,906). Generation of E. coli strains resistant to a-ketoisovaleric acid and/or a- 
ketobutyric acid, and/or a-aminobutyric acid, and/or P-hydroxyaspartic acid and/or O- 
methyl-threonine, in addition to salicylic acid, further increased pantothenic acid 
production. Moreover, transformation of a plasmid DNA carrying the panB, panC and 
35 panD genes into the salicylic acid resistant mutant strain resulted in increased 
pantothenate production, however, up to 20 g/L P-alanine or more was fed in the 
examples given. The panB-panC-panD genes are clustered on the E. coli chromosome. 
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Finally, a method of producing D-pantothenic acid has been described which 
utilizes a salicylic acid-resistant, a-ketoisovalerate-resistant, a-ketobutyrate-resistant, 
hydroxyaspartate-resistant, o-methylthreonine-resistent E. coli strain transformed with 
pantothenate biosynthesis gene-containing DNA fragments and/or branched amino acid 

5 biosynthesis gene-containing DNA fragments and cultured in the presence of p-alanine 
{U.S. Patent No. 5,932,457). 

Pantothenate production in bacteria results from the condensation of pantoate 
and P-alanine and involves the pantothenate biosynthetic enzymes ketopantoate 
hydroxymethyltransferase (the panB gene product), ketopantoate reductase (the panE 

10 gene product), aspartate-a-decarboxylase (the panD gene product) and pantothenate 
synthetase (the panC gene product). Although pantothenate is biologically active as a 
vitamin, it is further metabolized in all cells to Coenzyme A (CoA) which participates as 
an acyl group carrier in the tricarboxylic acid (TCA) cycle, fatty acid metabolism and 
numerous other reactions of intermediary metabolism. The initial (and possibly rate- 

1 5 controlling) step in the conversion of pantothenate to Coenzyme A (CoA) is 

phosphorylation of pantothenate by pantothenate kinase. A pantothenate kinase activity 
was first identified in Salmonella typhimurium by screening for temperature-sensitive 
mutants which synthesized CoA at permissive temperatures but excreted pantothenate at 
non-permissive temperatures. The mutations were mapped in the Salmonella 

20 chromosome and the genetic locus was designated coaA. The gene encodes the enzyme 
that catalyzes the first step in the biosynthesis of coenzyme A from pantothenate (Dunn 
and Snell (1979) J. Bacteriol 140:805-808). Escherichia coli temperature sensitive 
mutants have also been isolated and characterized (Vallari and Rock (1987) J. Bacteriol 
169:5795-5800). These mutants (named coaA15(Ts)) are defective in the conversion of 

25 pantothenate to CoA and further exhibit a temperature-sensitive growth phenotype, 
indicating that pantothenate kinase activity is essential for growth. Moreover, it was 
noted that CoA inhibited pantothenate kinase activity to the same degree in the mutant 
as compared to the wild-type enzyme. 

Feedback resistant E. coli mutants (named coaA16(Fr)) have also been isolated 

30 that posses a pantothenate kinase activity that is refractory to feedback inhibition by 
CoA (Vallari and Jackowski (1988) J. Bacteriol 170:3961-3966). The mutation 
responsible for the reversion is, suprisingly, not genetically linked to the coaA gene by 
transduction. Additional data described therein support the view that the total cellular 
CoA content is controlled by both modulation of biosynthesis at the pantothenate kinase 

35 step and possibly by degradation of CoA to 4'-phosphopantetheine. 
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The wild-type E. coli coaA gene was cloned by functional complementation of 
E. coli temperature-sensitive mutants. The sequence of the wild-type gene was 
determined (Song and Jackowski (1992) J. Bacteriol 174:6411-6417 and Flamm et al 
(1988) Gene (Amst.) 74:555-558). Strains containing multiple copies of the coaA gene 

5 possessed 76-fold higher specific activity of pantothenate kinase, however, there was 
only a 2.7-fold increase in the steady state level of CoA (Song and Jackowski, supra). It 
has further been reported that the prokaryotic enzyme (encoded by coaA in Exoli and a 
variety of other microorganisms) is feedback inhibited by CoA both in vivo and in vitro 
with CoA being about five times more potent than acetyl-CoA in inhibiting the enzyme 

1 0 (Song and Jackowski, supra and Vallari et al , supra). Moreover, it has been reported 
that the panB gene product in E. coli is inhibited by CoA (Powers and Snell (1976) J. 
Biol Chem. 251 :3786-3793). These data further support the vieto that feedback 
inhibition of pantothenate kinase activity is a critical factor controlling intracellular CoA 
concentration. 

1 5 Using standard search and alignment tools, coaA homologues have been 

identified in Hemophilus influenzae, Mycobacterium tuberculosis, Vibrio cholerae, 
Streptococcus pyogenes and Bacillus subtilis. By contrast, proteins with significant 
similarity could not be identified in eukaryotic cells including Saccharomyces cerevisiae 
or in mammalian expressed sequence tag (EST) databases. Using a genetic selection 

20 strategy, a cDNA encoding pantothenate kinase activity has recently been identified 
from Aspergillus nidulans (Calder et al (1999) J. Biol Chem. 274:2014-2020). The 
eukaryotic pantothenate kinase gene (panK) has distinct primary structure and unique 
regulatory properties that clearly distinguish it from its prokaryotic counterpart. A 
mammalian pantothenate kinase gene (mpanKla) has also been isolated which encodes 

25 a protein having homology to the A. nidulans PanK protein and to the predicted gene 
product of GenBank™ Accession Number 927798 identified in the S. cerevisiae 
genome (Rock et al. (2000) J. Biol Chem. 275:1377-1383). 

Summary of the Invention 

30 The present invention is based, at least in part, on the discovery of key enzyme- 

encoding genes of the pantothenate biosynthetic pathway in Bacillus subtilis. In 
partipular, the present inventors have identified the panE gene of A subtilis. 
Overexpression or deregulation of the panE gcne in B. subtilis results in enhanced 
production of the /?an£gene product, ketopantoate reductase, further resulting in 

35 increased production of pantothenate. Likewise, mutations in this gene reduce 
pantothenate production in B. subtilis >90%. The present inventors have further 
identified the presumptive panBCD operon in B. subtilis, overexpression or 
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deregulation of which results in increased pantothenate production. The present 
inventors have further demonstrated that overexpression or deregulation of the panD 
gene in B. subtilis (resulting in enhanced production of the panD gene product, 
aspartate-a-decarboxylase) further results in increased production of pantothenate, in 
5 particular, in combination with deregulation of genes encoding key enzymes of the 
isoleucine-valine (ilv) biosynthetic pathway. 

Accordingly, the present invention features methods of producing pantothenate, 
as well as other compounds of the pantothenate biosynthetic pathway (e.g., 
ketopantoate, pantoate and P-alanine), termed "panto-compounds" herein, using 

10 microorganisms in which the pantothenate biosynthetic pathway and/or isoleucine- 
valine biosynthetic pathway has been manipulated such that pantothenate or other 
desired panto-compounds are produced. In one embodiment, the invention features a 
method of producing a panto-compound (e.g., pantothenate or pantoate) that involves 
culturing a microorganism which overexpresses the panE geiie product, ketopantoate 

1 5 reductase, also referred to herein as a ketogantoate reductase-overexpressing or 
"KPAR-O" microorganism, under conditions such that the panto-compound (e.g., 
pantothenate or pantoate) is produced. In another embodiment, the present invention 
features a method of producing panto-compounds (e.g., pantothenate or pantoate) 
which includes culturing a microorganism which overexpresses at least one 

20 pantothenate biosynthetic enzyme (e.g. , at least one of the panB, panC or panD gene 
products), preferably in a KPAR-0 microorganism, under conditions such that the 
panto-compound (e.g., pantothenate or pantoate) is produced. 

Yet another aspect of the invention features methods of producing panto- 
compounds which are independent of the need to feed precursors (e.g., p-alanine or 

25 aspartate and/or a-ketoisovalerate or valine). In one embodiment, the invention 

features a method of producing pantothenate in a manner independent of precursor feed 
that includes culturing an aspartate-a-decarboxylase-overexpressing (AaD-O) 
microorganism having a deregulated isoleucine-valine (/7v) pathway under conditions 
such that pantothenate is produced. In another embodiment, the invention features a 

30 method of producing pantothenate in a manner independent of precursor feed that 

includes culturing an AaD-0 microorganism having a deregulated pantothenate (pari) 
pathway and a deregulated isoleucine-valine (ilv) pathway, under conditions such that 
pantothenate is produced. In another embodiment, the invention features a method of 
producing pantothenate in a manner independent of aspartate or P-alanine feed that 

35 includes culturing an AaD-O microorganism under conditions such that pantothenate is 
produced. In another embodiment, the invention features a method of producing 
pantothenate in a manner independent of valine or a-ketoisovalerate feed that includes 
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♦ 

culturing a microorganism having a deregulated isoleucine-valine (i/v) biosynthetic 
pathway under conditions such that pantothenate is produced. In yet another 
embodiment, the invention features a high yield production method for producing , 
pantothenate that includes culturing a manipulated microorganism under conditions 
5 such that pantothenate is produced at a significantly high yield (e.g., at a level -greater 
than 10 g/L, 20 g/L, 30 g/L or 40g/L). 

The methods of the present invention further feature microorganisms that 
overexpresses acetohydroxyacid synthetase or acetohydroxyacid isomeroreductase (e.g., 
microorganisms transformed with a vector that includes an ilvBNC nucleic acid 
1 0 sequence), microorganisms that overexpresses dihydroxyacid dehydratase (e.g. , 

microorganisms transformed with a vector that includes an ilvD nucleic acid sequence), 
microorganisms that overexpresses aspartate-a-decarboxylase (e.g., microorganisms 
transformed with a vector that includes a panD nucleic acid sequence), microorganisms 
having a deregulated isoleucine-valine </7v) biosynthetic pathway and microorganisms 
1 5 having a deregulated pantothenate biosynthetic pathway (e.g. , microorganisms that 
overexpress any of ketopantoate hydroxymethyltransferase, ketopantoate reductase, 
pantothenate synthetase and aSpartate-ct-decarboxylase, for example, microorganisms 
transformed with a vector comprising a panBCD nucleic acid sequence or a vector 
comprising a panEl nucleic acid sequence). In one embodiment, the recombinant 
20 microorganism is Gram positive (e.g., microorganisms belonging to the genus Bacillus, 
Cornyebacterium, Lactobacillus, Lactococci or Streptomyces). In another embodiment, 
the recombinant microorganism is Gram negative. Particularly preferred is a Bacillus 
recombinant microorganism (e.g., Bacillus licheniformis, Bacillus amyloliquefaciens, 
Bacillus subtilis, Bacillus pumilus, Bacillus halodurans, and the like). Recombinant 
25 vectors that contain the genes encoding Bacillus pantothenate and/or isoleucine-valine 
biosynthetic enzymes (e.g., B. subtilis pantothenate and/or isoleucine-valine biosynthetic 
enzymes) are also described. 

Also featured are methods of producing P-alanine that include culturing an 
aspartate-a-decarboxylase-overexpressing (AaD-O) microorganism under conditions 
30 such that p-alanine is produced and methods of producing p-alanine that involve 
contacting a composition comprising aspartate with an isolated Bacillus aspartate-a- 
decarboxylase enzyme under conditions such that p-alanine is produced. 

The production methods of the present invention further can include recovering 
the panto-compound <e. g. , pantothenate or pantoate). 
35 The present invention further features recombinant microorganisms (e.g. , AaD- 

O microorganisms, microorganisms having a deregulated isoleucine-valine .(i/v) 
pathway, microorganisms overexpressing at least one of ketopantoate 
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hydroxymethyltransferase (the panB gene product), pantothenate synthetase {the panC 
gene product), aspartate-a-decarboxylase (the panD gene product), ketopantoate 
reductase (the panEl gene product) and microorganisms having a deregulated panBCD 
operon. Also featured are panB, panC, panD, panE, ilvB, ilvN, alsS, ilvC, and/or ilvD 
5 nucleic acid molecules, as well as vectors including such nucleic acid molecules and 
gene products encoded by such nucleic acid molecules. 

The methodology of the present invention further includes, for example in 
addition to overexpressing at least one pantothenate biosynthetic enzyme, deleting or 
mutating a second pantothenate biosynthetic enzyme, said second pantothenate 
1 0 biosynthetic enzyme preferably being downstream of the desired product in the 
pantothenate biosynthetic pathway. For example, mutating panC, in addition to 
overexpressing the panE gene product, results in even further enhanced or increased 
production of pantoate. Accordingly, in one embodiment, the invention features a 
method of producing pantoate which includes culturing a microorganism which 
1 5 overexpresses the panE gene product and which has a deletion in the panC gene. In 
another embodiment, the invention features a method of producing pantoate which 
includes culturing a microorganism which overexpresses the panE gene product and/or 
panB gene product and which has a deletion in the panC gene. Other exemplary 
embodiments include a method of producing ketopantoate which includes culturing a 
20 microorganism which overexpresses the panB gene product and which has a deletion in 
the panE gene and a method of producing P-alanine which includes culturing a 
microorganism which overexpresses the panD gene product and which has a deletion in 
the panC gene. Also included are methods of producing panto-compounds which 
include overexpressing at least one valine biosynthetic enzyme in a microorganism 
25 which has at least one pantothenate biosynthetic enzyme deleted. 

The present invention is also based at least in part, on the identification and 
characterization of a previously unidentified microbial pantothenate kinase gene, coaX. 
CoaXv/as first identified in Bacillus subtilis and corresponds to an open reading frame 
in a portion of the chromosomal DNA that includes the 5' end of the ftsH gene, and all 
30 of the yacB, yacC, yacD, cysK and pabB genes. The present inventors have 

demonstrated that the yacB open reading frame encodes a novel pantothenate kinase 
activity, the gene being unrelated by homology to any previously known pantothenate 
kinase gene. The gene has been renamed coaX, as it encodes the enzyme which 
catalyzes the first step in the pathway from pantothenate to CoaA. 
35 Accordingly, the present invention features new and improved methods of 

producing pantothenate and other key compounds of the pantothenate biosynthetic 
pathway (e.g., panto-compounds) utilizing microorganisms having modified 
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pantothenate kinase activity. In particular, the present invention features recombinant 
microorganisms that contain the cooTgene or that contain a mutant coaVgene, having 
reduced pantothenate kinase activity. In one embodiment, the invention features such 
recombinant microorganisms further having a deregulated pantothenate biosynthetic 
5 pathway. In another embodiment, the invention features such recombinant 

microorganisms further having a deregulated isoleucine-valine (z7v) pathway. In a 
preferred embodiment, the microorganisms belong to the genus Bacillus (e.g., B. 
subtilis). 

The present invention also features recombinant microorganisms (e.g., 
1 0 microorganisms belonging to the genus Bacillus, for example, 5. subtilis) that contain 
the coaA gene or that contain a mutant coaA gene, optionally including a coaX gent or 
mutant thereof, having reduced pantothenate kinase activity. In one embodiment, the 
invention features such recombinant microorganisms further having a deregulated 
pantothenate biosynthetic pathway or having a deregulated isoleucine-valine (i/v) 
15 pathway. 

Also featured are vectors that contain isolated coaX or coaA genes as well as 
mutant coaX and/or coaA genes. Isolated nucleic acid molecules that contain isolated 
coaX genes or mutant coaX genes are featured in addition to isolated CoaX proteins and 
mutant CoaX proteins. 

20 The nucleic acids, vectors and recombinant microorganisms described above are 

particularly useful in the methodologies of the present invention. In particular, the 
invention features methods of enhancing panto-compound production <e.g, 
ketopantoate, pamoate and or pantothenate production) that include culturing a 
recombinant microorganism having a mutant coaX gene under conditions such that 

25 panto-compound production is enhanced. In one embodiment, the recombinant 
microorganism further includes a mutant coaA gene. In another embodiment, the 
recombinant microorganism further includes a mutant avtA and/or mutant ilvE gene 
and/or mutant ansB gene and/or mutant alsD gene. Also featured are methods for 
identifying pantothenate modulators utilizing the recombinant microorganisms and 

30 purified CoaX proteins of the present invention. 

Other features and advantages of the invention will be apparent from the 
following detailed description and claims. 
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Brief Description of the Drawings 

Figure 1 is a schematic representation of the pantothenate biosynthetic pathway. 

Figure 2 is a schematic representation of the plasmid pAN240, containing 
sequences ligated upstream of the P 2( $panBCD cassette, equivalent to the integrated 
5 version in strain P A22 1 . 

Figure 3 A is a schematic representation of the plasmid pAN004, containing the 
panBCD operon expressed from P26 and RBS1 . 

Figure 3B is a schematic representation of the plasmid pAN006, containing the 
panBCD operon expressed from P26 and RBS2. 
10 Figure 4 is a schematic representation of the plasmid pAN236, containing an 

integratable and amplifiable P 2 6-RBS2-panEl expression cassette. 

Figure 5 is a schematic representation of the construction of plasmid pAN423. 

Figure 6 is a schematic representation of the construction of plasmids pAN426 
and pAN427. 

1 5 Figure 7 is a schematic representation of the construction of plasmids pAN428 

and pAN429. 

Figure 8 is a schematic representation of the construction of plasmid pAN43 1 . 

Figure 9 is a schematic representation of the construction of plasmid pAN441 . 

Figure 10 is a schematic representation of the construction of plasmid pAN440. 
20 Figure 11 is a schematic representation of the plasmid pAN251 designed to 

integrate a single copy of a P 26 -panEl cassette at the panEl locus by double crossover. 

Figure 12 is a schematic representation of the plasmid pAN267 designed to 
integrate a single copy of a P 26 -ilvBNC cassette at the amyE locus. 

Figure 13 is a schematic representation of the plasmid pAN257, a clone of 
25 Bacillus subtilis ilvD in a low copy vector. 

Figure 14 is a schematic representation of the plasmid pAN263, designed to 
integrate a single copy of a P 26 -ilvD cassette at the ilvD locus. 

Figure 15 is a schematic representation of the plasmid pAN261 , designed to 
disrupt the Bacillus subtilis ilvD gene with the cat gene. 
30 Figure 16 is a schematic representation of the Coenzyme A biosynthetic pathway 

in E. coli. 

Figure 7 7 is a schematic representation of the structure of pAN296, a plasmid 
designed to delete most of the B. subtilis coaA gene and substitute a chloramphenicol 
resistance gene. 

35 Figure 18 is a schematic representation of the structure of the Bacillus subtilis 

genome in the region of the coaA gene. The scale is in base pairs and the significant 
open reading frames are shown by open arrows. 
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Figure 19 is a schematic representation of theplasmid pAN281, aplasmid for 
expressing Bacillus subtilis coaA after integration at the bpr locus. 

Figure 20A-B depicts a multiple sequence alignment (MSA) of the amino acid 
sequences encoded by six known or predicted microbial coaA genes. SEQ ID NOs:4-6 
5 and 1 -3 correspond to the amino acid sequences of Mycobacterium leprae (SwissProt™ 
Accession No. Q9X795), Mycobacterium tuberculosis (SwissProt™ Accession No. 
053440), Streptomyces coelicolor (SwissProt™ Accession No. 086799), Haemophilus 
influenzae (SwissProt™ Accession No. P44793), Escherichia coli SwissProt™ 
Accession No. PI 5044) and Bacillus subtilis (SwissProt™ Accession No. P54556), 
1 0 respectively. The alignment was generated using Clustal W MSA software at the 
GenomeNet CLUSTALW Server at the Institute for Chemical Research, Kyoto 
University. The following parameters were used: Pairwise Alignment, K-tuple (word) 
size = 1, Window size = 5, Gap Penalty = 3, Number of Top Diagonals = 5, Scoring 
Method = Percent; Multiple Alignment, Gap Open Penalty = 10, Gap Extension Penalty . 
1 5 = 0.0, Weight Transition = No, Hydrophilic residues = Gly, Pro, Ser, Asn, Asp, Gin, 
Glu, Arg and Lys, Hydrophobic Gaps = Yes; and Scoring Matrix = BLOSUM. 

Figure 2/ is a schematic representation of the structure of the Bacillus subtilis 
genome in the region of the coaX (yacB) gene. The scale is in base pairs, the ♦significant 
open reading frames are shown by open arrows and certain predicted restriction 
20 fragments are indicated by thick bars. 

Figure 22 is a schematic representation of the structure of pAN341 and pAN342, 
two independent PCR-derived clones of B. subtilis yacB (remaned herein as coaX), 

Figure 23A-D depicts a multiple sequence alignment (MSA) of the amino acid 
sequences encoded by fourteen known or predicted microbial coaX-genes. SEQ ID 
25 NOs:9, 74, 7-8, 75, 1 1 , 10 and 1 2-1 8 correspond to the amino acid sequences of Bacillus 
subtilis (SwissProt™ Accession No. P37564), Clostridium acetobulyticum (WIT™ 
Accession No. RCA03301, Argonne National Laboratories), Streptomyces coelicolor 
(PIR™ Accession No. T36391), Mycobacterium tuberculosis (SwissProt™ Accession 
No. 006282), Rhodobacter capsulatus (WIT™ Accession No. HRC02473), 
30 Desulfovibrio vulgaris (DBJ™ Accession No. BAA2 1 476. 1 ), Deinococcus radiodurans 
(SwissProt™ Accession No. Q9RX54), Thermotoga mar/7/ma{GenBank™ Accession 
No. AAD35964.1), Treponema pallidum {SwissProt™ Accession No. 083446), 
Borrelia burgdorferi (SwissProt™ Accession No.051477), Aquifex aeolicus 
(SwissProt™ Accession No. 067753), Synechocystis sp. (SwissProt™ Accession No. 
35 P74045), Helicobacter pylori (SwissProt™ Accession No. 025533), and Bordetella 
pertussis (SwissProt™ Accession No. Q45338), respectively. The alignment was 
generated using Clustal W MSA software at the GenomeNet CLUSTALW Server at the 
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Institute for Chemical Research, Kyoto University. The following parameters were 
used: Pairwise Alignment, K-tuple (word) size = 1, Window size = 5, Gap Penalty = 3, 
Number of Top Diagonals = 5, Scoring Method = Percent; Multiple Alignment, Gap 
Open Penalty « 10, Gap Extension Penalty = 0.0, Weight Transition = No, Hydrophilic 
5 residues = Gly, Pro, Ser, Asn, Asp, Gin, GIu, Arg and Lys, Hydrophobic Gaps = Yes; 
and Scoring Matrix = BLOSUM. 

Figure 24 depicts a multiple sequence alignment of a portion of the protein 
sequences of the coaA gene products from the following microorganisms: Bacillus 
subtilis, Escherichia coli, Haemophilus influenzae, Mycobacterium leprae, 

1 0 Mycobacterium tuberculosis, and Streptomyces coelicolor. The residues that are 

mutated in E. coli coaA15(Ts) and B. subtilis coaA282A are indicated below and above 
the alignment, respectively. The portions correspond to amino acid residues 168-1 87 of 
SEQ ID NO:3, 167-186 of SEQ IDNO:2, 165-184 of SEQ IDNO:l, 169-188 of SEQ 
IDNO:4, 169-188 of SEQ IDNO:5 and 179-198 of SEQ ID Jsl0:6, respectively. 

1 5 Figure 25 is a schematic representation of the structure of pAN294, a plasmid 

for integrating mutagenized B. subtilis coaA at its native locus. 

Figure 26 is a schematic representation of the structure of pAN336, a plasmid 
designed to delete B. subtilis coaX from its chromosomal locus and replace it with a 
kanamycin resistence gene. 

20 

Detailed Description of the Invention 

The present invention features new and improved methods of producing 
pantothenate and other key compounds of the pantothenate biosynthetic pathway 
(referred to herein as "panto-compounds", for example, pantothenate, ketopantoate, 
25 pantoate and (J-alanine) using microorganisms in which the pantothenate biosynthetic 
pathway has been manipulated such that pantothenate or other desired panto-compounds 
are produced. 

The new and improved methodologies of the present invention include methods 
of producing panto-compounds (e.g., pantothenate) in microorganisms having at least 

30 one enzyme of the pantothenate biosynthetic pathway manipulated such that 

pantothenate or other desired panto-compounds are produced (e.g., produced at an 
increased level). For example, the invention features methods of producing panto- 
compounds (e.g., pantothenate) in microorganisms having at least one of ketopantoate 
hydroxymethyltransferase, ketopantoate reductase, pantothenate synthetase or aspartate- 

35 a-decarboxylase manipulated such that pantothenate or other desired panto-compounds 
are produced. The methodologies of the present invention also include methods of 
producing panto-compounds (e.g., pantothenate) in microorganisms having at least one 
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valine-isoleucine biosynthetic enzyme, described herein, manipulated such that 
pantothenate or other desired panto-compounds are produced. For example, the 
invention features methods of producing panto-compounds (e.g. , pantothenate) in ( 
microorganisms having at least one of acetohydroxyacid synthetase, acetohydroxyacid 
5 isomeroreductase or dihydroxyacid dehydratase manipulated ^uch that pantothenate or 
other desired panto-compounds are produced. 

The invention also features methods of producing panto-compounds that involve 
culturing a ketopantoate reductase-overexpressing(KPAR-O) microorganism under 
conditions such that the panto-compound is produced. The invention also features 
1 0 methods of producing pantothenate in a manner independent of precursor feed that 

involve culturing an aspartate-a-decarboxylase-overexpressing (ActD-O) microorganism 
under conditions such that pantothenate is produced. Also featured are ^-alanine 
independent high yield pantothenate production methods as well as methods of 
producing p-alanine. The present invention also features methods for enhancing 
1 5 production of panto-compounds that involve culturing pantothenate kinase mutants. In 
particular, the present invention features new and improved methods of producing 
pantothenate arid other key compounds of the pantothenate biosynthetic pathway (e.g. , 
panto-compounds) utilizing microorganisms having modified pantothenate kinase 
activity, for example, microorganisms that include the coaA^gene or that include a 
20 mutant coaX gene, having reduced pantothenate itinase activity. 

In order that the present invention may be more readily understood, certain terms 
are first defined herein. 

The term "pantothenate biosynthetic pathway" includes the biosynthetic pathway 
involving pantothenate biosynthetic enzymes (e.g. , polypeptides encoded by 
25 biosynthetic enzyme-encoding genes), compounds (e.g. , precursors, substrates, ~ 
intermediates or products), cofactors and the like utilized in the formation or synthesis 
of pantothenate. The term "pantothenate biosynthetic pathway" includes the 
biosynthetic pathway leading to the synthesis of pantothenate in a microorganisms (e.g., 
in vivo) as well as the biosynthetic pathway leading to the synthesis of pantothenate in 
30 vitro. Figure 1 includes a schematic representation of the pantothenate biosynthetic 
pathway. Pantothenate biosynthetic enzymes are depicted in bold and their 
corresponding genes indicated in italics. 

The term "pantothenate biosynthetic enzyme" includes any enzyme utilized in 
the formation of a compound (e.g., intermediate or product) of the pantothenate 
35 biosynthetic pathway. According to Figure 1 , synthesis of pantoate from a- 

ketoisovalerate (a-KJV) proceeds via the intermediate, ketopantoate. Formation of 
ketopantoate is catalyzed by the pantothenate biosynthetic enzyme ketopantoate 
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hydroxymethyltransferase (the panB gene product). Formation of pamoate is catalyzed 
by the pantothenate biosynthetic enzyme ketopantoate reductase (the panE gene 
product). Synthesis of P-alanine from aspartate is catalyzed by the pantothenate 
biosynthetic enzyme aspartate-a-decarboxylase (the panD gene product). Formation of 

5 pantothenate from pantoate and p-alanine (e.g., condensation) is catalyzed by the 
pantothenate biosynthetic enzyme pantothenate synthetase (the panC gene product). 

The term "isoleucine-valine biosynthetic pathway" includes the biosynthetic 
pathway involving isoleucine-valine biosynthetic enzymes (e.g., polypeptides encoded 
by biosynthetic enzyme-encoding genes), compounds (e.g., precursors, substrates, 

10 intermediates or products), cofactors and the like utilized in the formation or synthesis 
of conversion of pyruvate to valine or isoleucine. The term "isoleucine-valine 
biosynthetic pathway" includes the biosynthetic pathway leading to the synthesis of 
valine or isoleucine in a microorganisms (e.g., in vivo) as well as the biosynthetic 
pathway leading to the synthesis of valine or isoleucine in vitro. Figure 1 includes a 

15 schematic representation of the isoleucine-valine biosynthetic pathway. Isoleucine- 
valine biosynthetic enzymes are depicted in bold italics and their corresponding genes 
indicated in italics 

The term "isoleucine-valine biosynthetic enzyme" includes any enzyme utilized 
in the formation of a compound (e.g., intermediate or product) of the isoleucine-valine 

20 biosynthetic pathway. According to Figure 1 , synthesis of valine from pyruvate 

proceeds via the intermediates, acetolactate, a,P-dihydroxyisovalerate (a,P-DHI V) and 
a-ketoisovalerate (a-KIV). Formation of acetolactate from pyruvate is catalyzed by the 
isoleucine-valine biosynthetic enzyme acetohydroxyacid synthetase (the ilvBN gene 
product, or alternatively, the alsS gene product). Formation of a,p-DHIV from 

25 acetolactate is catalyzed by the isoleucine-valine biosynthetic enzyme 

acetohydroxyacidisomero reductase (the ilvC gene product). Synthesis of a-KIV from 
a,P-DHIV is catalyzed by the isoleucine-valine biosynthetic enzyme dihydroxyacid 
dehydratase (the ilvD gene product). Moreover, valine and isoleucine can be 
interconverted by branched chain amino acid transaminases. 

30 As used herein, each of ketopantoate, pantoate, p-alanine and pantothenate are 

"panto-compounds". The term "panto-compound" includes a compound (e.g., a 
substrate, intermediate or product) in the pantothenate biosynthetic pathway which is 
downstream from a particular pantothenate biosynthetic enzyme. In one example, a 
panto-compound is downstream of the pantothenate biosynthetic enzyme ketopantoate 

35 hydroxymethyltransferase (the panB gene product) and can include ketopantoate, 

pantoate and/or pantothenate. In another example, a panto-compound is downstream of 
the pantothenate biosynthetic enzyme ketopantoate reductase (the panE gene product) 
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and can include pantoate and/or pantothenate. In yet another example, a panto- 
compound is downstream of the pantothenate biosynthetic enzyme pantothenate 
synthetase (the panC gene product) and can include pantothenate, in yet another 
example, a panto-compound is downstream of the pantothenate biosynthetic enzyme 
5 aspartate-a-decarboxylase (the panD gene product) and can include ^-alanine and/or 
pantothenate. 

Preferred panto-compounds include pantothenate and pantoate. The term 
"pantothenate" includes the free acid form of pantothenate, also referred to as 
"pantothenic acid" as well as any salt thereof {e.g. , derived by replacing the acidic • 
1 0 hydrogen of pantothenate or pantothenic acid with a cation, for example, calcium, 
sodium, potassium, ammonium), also referred to as a "pantothenate salt". The term 
"panto-compound" also includes alcohol derivatives of pantothenate. Preferred 
pantothenate salts are calcium pantothenate or sodium pantothenate. A preferred 
alcohol derivative is pantothenol. Pantothenate salts and/or alcohols of the present 
1 5 invention include salts and/or alcohols prepared v/a-conventional methods fromJhe free 
acids described herein. In another embodiment, calcium pantothenate is synthesized 
directly by a microorganism of the present invention. A pantothenate salt of the present, 
invention can likewise be converted to a free acid form of pantothenate or pantothenic 
acid by conventional methodology. 
20 The term "pantoate" includes the free acid form of pantoate, also referred to as 

"pantoic acid" as well as any salt thereof (e.g., derived by replacing the acidic hydrogen 
of pantoate or pantoic acid with a cation, for-example, calcium, sodium, potassium, 
ammonium), also referred to as a "pantoate salt". Preferred pantoate-salts are calcium 
pantoate or sodium pantoate. Pantoate salts of the present invention include salts 
25 prepared via conventional methods from the free acids described herein. A pantoate salt 
of the present invention can likewise be converted to a free acid form of pantoate or 
pantoic acid by conventional methodology. Moreover, a free acid form of pantoate or 
pantoic acid can be converted to pantolactone by conventional methodology. 

The term "CoA biosynthetic pathway" includes the biosynthetic pathway 
30 involving CoA biosynthetic enzymes {e.g. , polypeptides encoded by biosynthetic 
enzyme-encoding genes), compounds {e.g., precursors, substrates, intermediates or 
products), cofactors and the like utilized in the formation or synthesis of GoA from 
pantothenate. A schematic representation of the CoA biosynthetic pathway in E. coli is 
set forth as Figure 16. (The pathway depicted is also presumed to be that utilized by 
3 5 other microorganisms.) The term "CoA biosynthetic pathway" includes the biosynthetic 
pathway leading to the synthesis of CoA in microorganisms (e.g., in vivo) as well as the 
biosynthetic pathway leading to the synthesis of CoA in vitro. The term "Coenzyme A 



WO 01/21772 



-15- 



PCT/US00/25993 



or Co A biosynthetic enzyme" includes any enzyme utilized in the formation of a. 
compound (e.g., intermediate or product) of the CoA biosynthetic pathway, for example, 
the coaA,panK or coaX gene product which catalyzes the phosphorylation of 
pantothenate to form 4'-phosphopantothenate, or the coaD gene product which catalyzes 
5 the conversion of 4'-phosphopantetheine to dephosphocoenzyme A. 

I. Recombinant Microorganisms and Methods for Culturing 
Microorganisms Such That Panto-Compounds are Produced 

The methodologies of the present invention feature microorganisms, e.g., 

1 0 recombinant microorganisms, preferably including vectors or genes (e.g. , wild-type 
and/or mutated genes) as described herein and/or cultured in a manner which results in 
the production of a desired product (e.g. a panto-compound or panto-compounds). The 
term "recombinant" microorganism includes a microorganism (e.g., bacteria, yeast cell, 
fungal cell, etc.) which has been genetically altered, modified or engineered (e.g., 

1 5 genetically engineered) such that it exhibits an altered, modified or different genotype 
and/or phenotype (e.g., when the genetic modification affects coding nucleic acid 
sequences of the microorganism) as compared to the naturally-occurring microorganism 
from which it was derived. Preferably, a "recombinant" microorganism of the present 
invention has been genetically engineered such that it overexpresses at least one 

20 bacterial gene or gene product (e.g. , a pantothenate or isoleucine-valine biosynthetic 

enzyme encoding-gene) as described herein, preferably a biosynthetic enzyme encoding- 
gene included within a recombinant vector as described herein and/or a biosynthetic 
enzyme expressed from a recombinant vector. The ordinary skilled will appreciate that 
a microorganism expressing or overexpressing a gene product produces or overproduces 

25 the gene product as a result of expression or overexpression of nucleic acid sequences 
and/or genes encoding the gene product. 

The term "manipulated microorganism" includes a microorganism that has been 
engineered (e.g., genetically engineered) or modified such that the microorganism has at 
least one enzyme of the pantothenate biosynthetic pathway and/or at least one enzyme of 

30 the isoleucine-valine biosynthetic pathway modified such that pantothenate or other 
desired panto-compounds are produced. Modification or engineering of such 
microorganisms can be according to any methodology described herein including, but 
not limited to, deregulation of a biosynthetic pathway and/or overexpression of at least 
one biosynthetic enzyme. A "manipulated" enzyme (e.g., a "manipulated" biosynthetic 

35 enzyme) includes an enzyme, the expression or production of which has been altered or 
modified such that at least one upstream or downstream precursor, substrate or product 
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of the enzyme is altered or modified, for example, as compared to a corresponding wild- 
type or naturally occurring enzyme. 

The term "overexpiessed" or "overexpression" includes expression of a gene 
product (eg. , a pantothenate biosynthetic enzyme or isoieucine-valine biosynthetic 
5 enzyme) at a level greater than thatexpressed prior to manipulation of the 

microorganism or in a comparable microorganism which has not been manipulated. In 
one embodiment, the microorganism can be genetically manipulated (e.g., genetically 
engineered) to overexpress a level of gene product greater than that expressed prior to 
manipulation of the microorganism or in a comparable microorganism which has not - 
10 been manipulated. Genetic manipulation can include, but is not limited to, altering or 
modifying regulatory sequences or sites associated with expression of a particular gene 
{e.g., by adding strong promoters, inducible promoters or multiple promoters or by 
removing regulatory sequences such that expression is constitutive), modifying the 
chromosomal location of a particular gene, altering nucleic acid sequences adjacent to a 
1 5 particular gene such as a ribosome binding site or transcription terminator, increasing 
the copy number of a particular gene, modifying proteins {e.g. , regulatory proteins, 
suppressors, enhancers, transcriptional activators and the like) involved in transcription 
of a particular gene and/or translation of a particular gene product, or any other 
conventional means of deregulating expression of a particular gene routine in the art 
20 (including but not limited to use of antisense nucleic acid molecules, for example, to 
block expression of repressor proteins). 

In another embodiment, the microorganism can be physically or environmentally 
manipulated to overexpress a level of gene product greater than that expressed prior to 
manipulation of the microorganism or in a comparable microorganism which has not 
25 been manipulated. For example, a microorganism can be treated with or cultured in the 
presence of an agent known or suspected to increase transcription of a particular gene 
and/or translation of a particular gene product such that transcription and/or translation 
are enhanced or increased. Alternatively, a microorganism can be cultured at a 
temperature selected to increase transcription of a particular gene and/or translation of a 
30 particular gene product such that transcription and/or translation are enhanced or 
increased. 

The term "deregulated" or "deregulation" includes the alteration or modification 
of at least one gene in a microorganism that encodes an enzyme in a biosynthetic 
pathway, such that the level or activity of the biosynthetic enzyme in the microorganism 
35 is altered or modified. Preferably, at least one gene that encodes an enzyme in a 

biosynthetic pathway is altered or modified such that the gene product is enhanced or 
increased. The phrase "deregulated pathway" can also include a biosynthetic pathway in 
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which more than one gene that encodes an enzyme in a biosynthetic pathway is altered 
or modified such that the level or activity of more than one biosynthetic enzyme is 
altered or modified. The ability to "deregulate" a pathway (e.g., to simultaneously 
deregulate more than one gene in a given biosynthetic pathway) in a microorganism 
5 arises from the particular phenomenon of microorganisms in which more than one 
enzyme (e.g., two or three biosynthetic enzymes) are encoded by genes occurring 
adjacent to one another on a contiguous piece of genetic material termed an "operon". 

The term "operon" includes a coordinated unit of gene expression that contains a 
promoter and possibly a regulatory element associated with one or more, preferably at 
10 least two, structural genes (e.g., genes encoding enzymes, for example, biosynthetic 
enzymes). Expression of the structural genes can be coordinately regulated, for 
example, by regulatory proteins binding to the regulatory element or by anti-termination 
of transcription. The structural genes can be transcribed to give 1 a single mRNA that 
encodes all of the structural proteins. Due to the coordinated regulation of genes 
1 5 included in an operon, alteration or modification of the single promoter and/or 

regulatory element can result in alteration or modification of each gene product encoded 
by the operon. Alteration or modification of the regulatory element can include, but is 
not limited to removing the endogenous promoter and/or regulatory element(s), adding 
strong promoters, inducible promoters or multiple promoters or removing regulatory 
20 sequences such that expression of the gene products is modified, modifying the 

chromosomal location of the operon, altering nucleic acid sequences adjacent to the 
operon or within the operon such as a ribosome binding site, increasing the copy number 
of the operon, modifying proteins (e.g., regulatory proteins, suppressors, enhancers, 
transcriptional activators and the like) involved in transcription of the operon and/or 
25 translation of the gene products of the operon, or any other conventional means of 
deregulating expression of genes routine in the art (including but not limited to use of 
antisense nucleic acid molecules, for example, to block expression of repressor 
proteins). Deregulation can also involve altering the coding region of one or more genes 
to yield, for example, an enzyme that is feedback resistant or has a higher or lower 
30 specific activity. 

A particularly preferred "recombinant" microorganism of the present invention 
has been genetically engineered to overexpress a bacterially-derived gene or gene 
product. The term "bacterially-derived" or "derived-from", for example bacteria, 
includes a gene which is naturally found in bacteria or a gene product (e.g., ketopantoate 
35 hydroxy methyl transferase, ketopantoate reductase, pantothenate synthetase, aspartate-a- 
decarboxylate, acetohydroxyacid synthetase, acetohydroxyacid isomeroreductase or 



WO01/21772 PCT/USOO/25993 

-18- 



dihydroxyacid dehydratase) which is encoded by a bacterial gene (e.g. encoded by 
panB, panE, panC, panD, ilvB, ilvN, alsS, ilvC, or ilvD). 

The methodologies of the present invention feature recombinant microorganisms 
which overexpress at least one of ketopantoate hydroxymethyltransferase, ketopantoate 
5 reductase, pantothenate synthetase or aspartates-decarboxylase. A particularly 
preferred recombinant microorganism of the present invention has teen genetically 
engineered to overexpress a Bacillus (e.g. , Bacillus licheniformis, Bacillus 
amyloliquefaciens, Bacillus halodurans, Bacillus subtilis, and Bacillus pumilus, etc.) 
biosynthetic enzyme (e.g., has been engineered to overexpress at least one of B. subtilis 
1 0 ketopantoate reductase (the panE gene product) (e.g. , ketopantoate reductase having the 
amino acid sequence of SEQ ID NO.30 or encoded by the nucleic acid sequence of SEQ 
ID NO:29), B. subtilis ketopantoate hydroxymethyltransferase (the panB gene product) 
(e.g., ketopantoate hydroxymethyltransferase having the amino acid sequence of SEQ 
ID NO:24 or encoded by a nucleic acid molecule having the nucleotide sequence of 
1 5 SEQ ID NO:23), B. subtilis pantothenate synthetase (the pawC-gene product) (e.g. , 

pantothenate synthetase having the amino acid sequence of SEQ ID NO:26 or«encoded 
by a nucleic acid molecule having the nucleotidesequence of EEQ ID NO:25) and/or ft 
subtilis aspartate-a-decarboxylase (the panD gene product)<e.g., aspaitate-a- 
decarboxylase having the amino acid sequence of SEQ ID NO:28 or encoded by a 
20 nucleic acid molecule having the nucleotide sequence of SEQ ID NO:27). 

In an exemplary embodiment, the invention features a microorganism (e.g. , a 
KPAR-0 microorganism) that has been transformed with a vector comprising a panE 
nucleic acid sequence (e.g., a panE nucleic acid seqeunce as set forth in SEQ ID 
NO:29). In another embodiment, the invention features a microorganism that has been 
25 transformed with a vector comprising a panB nucleic acid sequence (e.g. , a panB nucleic 
acid sequence as set forth in SEQ ID NO:23), a vector comprising apanC nucleic acid 
sequence {e.g., a panC nucleic acid sequence as set forth in SEQ ID NO:25) or a vector 
comprising a panD nucleic acid sequence ie.g, a panD nucleic acid sequence as set 
forth in SEQ ID NO:27). In yet another embodiment, the invention features a 
30 microorganism having a deregulated panBCD operon (e.g. , SEQ ID NO:59). 

Other preferred "recombinant" microorganisms of the present invention have a 
deregulated isoleucine-valine (/7v) pathway. The phrase "microorganism having a 
deregulated isoleucine-valine (/7v) pathway" includes a microorganism having an 
alteration or modification in at least one gene encoding an enzyme of the isoleucine- 
35 valine (i/v) pathway or having an alteration or modification in an operon including more 
than one gene encoding an enzyme of the isoleucine-valine (/7v) pathway. A preferred 
"microorganism having a deregulated isoleucine-valine (/7v) pathway" has been 
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genetically engineered to overexpress a Bacillus {e.g., B. subtilis) ilv biosynthetic 
enzyme {e.g., has been engineered to overexpress at least one of acetohydroxyacid 
synthetase (the ilvBN gene products or the alsS gene product) (e.g., acetohydroxyacid 
synthetase having subunits having the amino acid sequences of SEQ ID NO:32 and £EQ 
5 ID NO:34 or encoded by nucleic acid molecules having the nucleotide sequence of SEQ 
ID NO:31 and SEQ ID NO:33 or the nucleotide sequence of SEQ ID NO:58 from 
nucleotides 1-2246 or acetohydroxyacid synthetase encoded by a nucleic acid molecule 
having the nucleotide sequence of SEQ ID NO:86), acetohydroxyacid isomeroreductase 
(the ilvC gene product) (e.g., acetohydroxyacid isomeroreductase having the amino acid 

1 0 sequence of SEQ ID NO:36 or encoded by a nucleic acid molecule having the 

nucleotide sequence of SEQ ID NO:35), dihydroxyacid dehydratase (the ilvD gene 
product) (e.g., dihydroxyacid dehydratase having the amino acid sequence of SEQ ID 
NO:38 or encoded by a nucleic acid molecule having the nucleotide sequence of SEQ 
ID NO:37), and/or has been transformed with a vector comprising an ilvBNC nucleic 

1 5 acid sequence (SEQ ID NO:58, coding regions from nucleotides 1 - 1 725, 1 722-2246 and 
2263-3291) and/or an ilvD nucleic acid sequence (SEQ ID NO:37). 

In another preferred embodiment, a recombinant microorganism is designed or 
engineered such that a mutant CoaA and/or CoaX biosynthetic enzyme is expressed and 
at least one pantothenate biosynthetic enzyme and/or at least one isoleucine-valine 

20 biosynthetic enzyme is overexpressed or deregulated. 

In another preferred embodiment, a microorganism of the present invention 
overexpresses or is mutated for a gene or biosynthetic enzyme (e.g., a Co A biosynthetic 
enzyme, pantothenate biosynthetic enzyme or isoleucine-valine biosynthetic enzyme) 
which is bacterially-derived. The term "bacterially-derived" or "derived-from", for 

25 example bacteria, includes a gene product (e.g., ketopantoate hydroxymethyltransferase, 
ketopantoate reductase, pantothenate synthetase, aspartate-ct-decarboxylate, 
acetohydroxyacid synthetase, acetohydroxyacid isomeroreductase, dihydroxyacid 
dehydratase or pantothenate kinase) which is encoded by a bacterial gene (e.g.,panB, 
panE, panC,panD, ilvBN (or alsS), ilvC, ilvD, or encoded by coaA or coaX). 

30 Still other preferred recombinant microorganisms of the present invention are 

mutant microorganisms. As used herein, the term "mutant microorganism" includes a 
recombinant microorganism that has been genetically engineered to express a mutated 
gene or protein that is normally or naturally expressed by the microorganism. 
Preferably, a mutant microorganism expresses a mutated gene or protein such that the 

35 microorganism exhibits an altered, modified or different phenotype (e.g., has been 

engineered to express a mutated CoaA biosynthetic enzyme, for example, pantothenate 
kinase). In one embodiment, a mutant microorganism is designed or engineered such 
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that it includes a mutant coaX gene, as defined herein. In another embodiment, a 
recombinant microorganism is designed or -engineered such that it includes a mutant 
coaA gene, as defined herein. In another embodiment, a mutant microorganism is 
designed or engineered such that a coaXgene has been deleted (i.e., the protein -encoded 
5 by the coo/f gene is not produced). In another embodiment, a mutant microorganism is 
designed or engineered such that a coaA gene has been deleted (i.e., the protein -encoded 
by the coaA gene is not produced). Preferably, a mutant microorganism has a mutant 
coaX gene or a mutant coaA gene, or has been engineered to have a coaX gene and/or 
coaA deleted, such that that the mutant microorganism encodes a "reduced pantothenate 
10 kinase activity". In the context of a whole microorganism, a "reduced pantothenate 
kinase activity" can be determined by measuring or assaying for a decrease in an 
intermediate or product of the CoA biosynthetic pathway, for example, measuring or 
assaying for 4'-phosphopantothenate, 4'-phosphopantothenylcysteine, 4'- 
.phosphopantetheine, dephosphocoenzyme A, Coenzyme A, apo-acyl carrier protein 
1 5 (apo-ACP) or holo-acyl carrier protein (ACP) in the microorganism {e.g. , in a lysate 
isolated or derived from the microorganism) or in the medium in which the 
microorganism is cultured (see e.g., Figure 16). Alternatively, a "reduced pantothenate , 
kinase activity" can be determined by measuring or assaying for ^decreased growth of the 
microorganism. Alternatively, a "reduced pantothenate kinase activity" can be 
20 determined by measuring or assaying for an increase in a panto-compound {e.g. , 
pantothenate) in the microorganism or surrounding media, as panto-compounds lie 
upstream of the CoA biosynthetic pathway, the first step of which is catalyzed by 
pantothenate kinase. The invention also features recombinant microorganisms that, in 
addition to having reduced pantothenate kinase activity (e.g., expressing mutant <oaA 
25 and/or mutant coaX genes) have a deregulated pantothenate biosynthesis pathway and/or 
a deregulated isoleucine-valine (i/v) biosynthetic pathway. 

In one embodiment, a recombinant microorganism of the present invention is a 
Gram positive organism {e.g., a microorganism which retains basic dye, for example, 
crystal violet, due to the presence of a Gram-positive wall surrounding the 
30 microorganism). In a preferred embodiment, the recombinant microorganism is a 
microorganism belonging to a genus selected from the group consisting of Bacillus, 
Cornyebacterium, Lactobacillus, Lactococci and Streptomyces. In a more preferred 
embodiment, the recombinant microorganism is of the genus Bacillus. In another 
preferred embodiment, the recombinant microorganism is selected from the group 
35 consisting of Bacillus sublilis, Bacillus lentimorbus, Bacillus lentus, Bacillus firmus, 
Bacillus pantothenticus, Bacillus amyloliquefaciens, Bacillus cereus, Bacillus circulans, 
Bacillus coagulans, Bacillus licheniformis, Bacillus megaterium, Bacillus pumilus, 
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Bacillus thuringiensis, and other Group 1 Bacillus species, for example, as characterized 
by 16S rRNA type (Priest (1993) in Bacillus subtilis and Other Gram-Positive Bacteria 
eds. Sonenshein et at., ASM, Washington, D.C., p. 6). In another preferred 
embodiment, the recombinant microorganism is Bacillus brevis or Bacillus 
5 stearothermophilus. In another preferred embodiment, the recombinant microorganism 
is selected from the group consisting of Bacillus licheniformis, Bacillus 
amyloliquefaciens, Bacillus halodurans, Bacillus subtilis, and Bacillus pumilus. 

In another embodiment, the recombinant microorganism is a Gram negative 
(excludes basic dye) organism. In a preferred embodiment, the recombinant 

1 0 microorganism is a microorganism belonging to a genus selected from the group 
consisting of Salmonella, Escherichia, Klebsiella, Serratia t and Proteus. In a more 
preferred embodiment, the recombinant microorganism is of the genus Escherichia. In 
an even more preferred embodiment, the recombinant microorganism is Escherichia 
coli. In another embodiment, the recombinant microorganism is Saccharomyces {e.g., S. 

1 5 cerevisiae). 

An important aspect of the present invention involves culturing the recombinant 
microorganisms described herein, such that a desired compound (e.g., a desired panto- 
compound) is produced. The term "culturing" includes maintaining and/or growing a 
living microorganism of the present invention {e.g., maintaining and/or growing a 

20 culture or strain). In one embodiment, a microorganism of the invention is cultured in 
liquid media. In another embodiment, a microorganism of the invention is cultured in 
solid media or semi-solid media. In a preferred embodiment, a microorganism of the 
invention is cultured in media {e.g., a sterile, liquid media) comprising nutrients 
essential or beneficial to the maintenance and/or growth of the microorganism {e.g., 

25 carbon sources or carbon substrate, for example complex carbohydrates such as bean or 
grain meal, starches, sugars, sugar alcohols, hydrocarbons, oils, fats, fatty acids, organic 
acids and alcohols; nitrogen sources, for example, vegetable proteins, peptones, peptides 
and amino acids derived from grains, beans and tubers, proteins, peptides and amino 
acids derived form animal sources such as meat, milk and animal byproducts such as 

30 peptones, meat extracts and casein hydrolysates; inorganic nitrogen sources such as 
urea, ammonium sulfate, ammonium chloride, ammonium nitrate and ammonium 
phosphate; phosphorus sources, for example, phosphoric acid, sodium and potassium 
salts thereof; trace elements, for example, magnesium, iron, manganese, calcium, 
copper, zinc, boron, molybdenum, and/or cobalt salts; as well as growth factors such as 

35 amino acids, vitamins, growth promoters and the like). 
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Preferably, microorganisms of the present invention are cultured under 
controlled pH. The term "controlled pH" includes any pH which results in production of 
the desired product (e.g. , a panto-compound). In one -embodiment, microorganisms are 
cultured at a pH of about 7. In another embodiment, microorganisms are cultured at a 
5 pH of between 6.0 and 8.5. The desired pH may be maintained by.any number of 
methods known to those skilled in the art. 

Also preferably, microorganisms of the present invention are cultured under 
controlled aeration. The term "controlled aeration" includes sufficient aeration <e.g., 
oxygen) to result in production of the desired product (e.g., panto-compound). In one 
1 0 embodiment, aeration is controlled by regulating oxygen levels in the culture, for 
example, by regulating the amount of oxygen dissolved in culture media. Preferably, 
aeration of the culture is controlled by agitating the culture. Agitation may be provided 
by a propeller or similar mechanical agitation equipment, by revolving or*shaking the 
growth vessel (e.g., fermentor) or by various pumping equipment. Aeration may be 
1 5 further controlled by the passage of sterile air or oxygen through the medium <e.g., 
through the fermentation mixture). Also preferably, microorganisms of the present 
invention are cultured without excess foaming {e.-g., via addition of antifoaming agents). 

Moreover, microorganisms of the present invention can be cultured under 
controlled temperatures. The term "controlled temperature" includes any temperature 
20 which results in production of the desired product (e.g., a panto-compound). In one 
embodiment, controlled temperatures include temperatures between 15°C and 95°C. In 
another embodiment, controlled temperatures include temperatures between 15°C and 
70°C. Preferred temperatures are between 20°C and 55°C, more preferably between 
30°C and 45°C or between 30°C and 50°C. 
25 Microorganisms can be cultured „(e.g. , maintained and/or grown) in liquid media 

and preferably are cultured, either continuously or intermittently, by conventional 
culturing methods such as standing culture, test tube culture, shaking culture<e.g., rotary 
shaking culture, shake flask culture, etc.), aeration spinner culture, or fermentation. In a 
preferred embodiment, the microorganisms are cultured in shake flasks. In a more 
30 preferred embodiment, the microorganisms are cultured in a fermentor (e.g., a 

fermentation process). Fermentation processes of the present invention include, but are 
not limited to, batch, fed-batch and continuous methods of fermentation. The phrase 
"batch process" or "batch fermentation" refers to a closed system in which the 
composition of media, nutrients, supplemental additives and the like is set at the 
35 beginning of the fermentation and not subject to alteration during the fermentation, 

however, attempts may be made to control -such factors as pH and oxygen concentration 
to prevent excess media acidification and/or microorganism death. The phrase "fed- 
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batch process" or "fed-batch" fermentation refers to a batch fermentation with the 
exception that one or more substrates or supplements are added (e.g., added in 
increments or continuously) as the fermentation progresses. The phrase "continuous 
process" or "continuous fermentation" refers to a system in which a defined 
5 fermentation media is added continuously to a fermentor and an equal amount of used or 
"conditioned" media is simultaneously removed, preferably for recovery of the desired 
product (e.g., panto-compound). A variety of such processes have been developed and 
are well-known in the art. 

The phrase "culturing under conditions such that a desired compound (e.g., a 

10 panto-compound, for example, pantothenate) is produced" includes maintaining and/or 
growing microorganisms under conditions (e.g., temperature, pressure, pH, duration, 
etc.) appropriate or sufficient to obtain production of the desired compound or to obtain 
desired yields of the particular compound being produced. For example, culturing is 
continued for a time sufficient to produce the desired amount of a panto-compound (e.g., 

1 5 , pantothenate, pantoate or ^-alanine). Preferably, culturing is continued for a time 
sufficient to substantially reach maximal production of the panto-compound. In one 
embodiment, culturing is continued for about 12 to 24 hours. In another embodiment, 
culturing is continued for about 24 to 36 hours, 36 to 48 hours,, 48 to 72 hours, 72 to 96 
hours, 96 to 120 hours, 120 to 144 hours, or greater than 144 hours. In another 

20 embodiment, culturing is continued for a time sufficient to reach production yields of 
panto-compound, for example, cells are cultured such that at least about 15 to 20 g/L of 
panto-compound are produced, at least about 20 to 25 g/L panto-compound are 
produced, at least about 25 to 30 g/L panto-compound are produced, at least about 30 to 
35 g/L panto-compound are produced, at least about 35 to 40 g/L panto-compound are 

25 produced (e.g., at least about 37 g/L panto-compound) or at least about 40 to 50 g/L 
panto compound are produced. In yet another embodiment, microorganisms are 
cultured under conditions such that a preferred yield of panto-compound, for example, a 
yield within a range set forth above, is produced in about 24 hours, in about 36 hours, in 
about 48 hours, in about 72 hours, or in about 96 hours. 

30 The methodology of the present invention can further include a step of 

recovering a desired compound (e.g., a panto-compound). The term "recovering" a 
desired compound (e.g., a panto-compound) includes extracting, harvesting, isolating or 
purifying the compound from culture media. Recovering the compound can be 
performed according to any conventional isolation or purification methodology known 

35 in the art including, but not limited to, treatment with a conventional resin (e.g., anion or 
cation exchange resin, non-ionic adsorption resin, etc.), treatment with a conventional 
adsorbent (e.g., activated charcoal, silicic acid, silica gel, cellulose, alumina, etc.), 
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alteration of pH, solvent extraction (e.g., with a conventional solvent such as an alcohol, 
ethyl acetate, hexane and the like), dialysis, filtration, concentration, crystallization, 
recrystallization, pH adjustment, lyophilization and the iike. For example, a compound 
(e.g., a panto-compound) can be recovered from culture media by first removing the 
5 microorganisms from the culture. Media is then passed through or over a cation 

exchange resin to remove unwanted cations and then through or over an anion exchange 
resin to remove unwanted inorganic anions and organic acids having stronger acidities 
than the panto-compound of interest (e.g., pantothenate). The resulting panto-compound 
(e.g., pantothenate) can subsequently be converted to a pantothenate salt (e.g., calcium 
1 0 pantothenate) as described herein. 

Preferably, a desired compound of the present invention is "extracted", 
"isolated" or "purified" such that the resulting preparation is substantially free of other 
components (e.g., free of media components and/or fermentation byproducts). The 
language "substantially free of other components" includes preparations of desired 
1 5 compound in which the compound is-separated ie.g. , purified or partially purified) from 
media components or fermentation byproducts of the culture from which it is produced. 
In one embodiment, the preparation has greater than about 80% (by dry weight) of the 
desired compound (e.g., less than about 20% of other media components or fermentation 
byproducts), more preferably greater than about 90% of the desired compound (e.g., less 
20 than about 1 0% of other media components or fermentation byproducts), still more 

preferably greater than about 95% of the desired compound (e.g. , less than about 5% of 
other media components or fermentation byproducts), and most preferably greater than 
about 98-99% desired compound (e.g., less than about 1-2% other media components or 
fermentation byproducts). When the desired compound is a panto-compound that has 
25 been derivatized to a salt (e.g. a pantothenate salt or pantoate salt), the?panto-compound 
is preferably further free (e.g., substantially free) of chemical contaminants associated 
with the formation of the salt. When the desired compound is a pantorcompound that 
has been derivatized to an alcohol, the panto-compound is preferably further free (e.g. , 
substantially free) of chemical contaminants associated with the formation of the 
30 alcohol. 

In an alternative embodiment, the desired panto-compound is not purified fr-om 
the microorganism, for example, when the microorganism is biologically non-hazardous 
(e.g., safe). For example, the entire culture (or culture supernatant) can be used as a 
source of product (e.g., crude product). In one embodiment, the culture (or culture 
35 supernatant) supernatant is used without modification. In another embodiment, the 
culture (or culture supernatant) is concentrated. In yet another embodiment, the -culture 
(or culture supernatant) is dried or lyophilized. 
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II Panto-Compound Production Methodologies Featuring Ketopantoate 
Reductase-Overexpressing Microorganisms 

One aspect of the invention features methods of producing a panto-compounds 
that involve culturing a ketopantoate reductase-overexpressing (KPAR-O) 
5 microorganism under conditions such that the panto-compound is produced. The term 
"ketopantoate reductase-overexpressing (KPAR-O) microorganism" includes a 
microorganism which has been manipulated such that ketopantoate reductase is 
overexpressed (e.g., a B. subtilis ketopantoate reductase protein having the amino acid 
sequence of SEQ ID NO:30) and/or has been transformed with a vector comprising a 

1 0 panEI nucleic acid sequence (e.g., a B. subtilis panEl nucleic acid sequence as set forth 
in SEQ ID NO:29). In one embodiment, the panto-compound is pantothenate. In 
another embodiment, the panto-compound is pamoate. In another embodiment, the 
ketopantoate reductase is bacterial-derived. In another embodiemnt, the ketopantoate 
reductase is derived from Bacillus (e.g., is derived from Bacilfus subtilis). In yet 

1 5 another embodiment, the KPAR-O microorganism is Gram positive. In yet another 
embodiment, the KPAR-O microorganism is a microorganism belonging to a genus 
selected from the group consisting of Bacillus, Corny ebacterium, Lactobacillus, 
Lactococci and Streptomyces. In a preferred embodiemnt, the KPAR-O microorganism 
is of the genus Bacillus. In a more preferred embodiment, the KPAR-O microorganism 

20 is selected from the group consisting of Bacillus licheniformis, Bacillus 

amyloliquefaciens, Bacillus halodurans, Bacillus subtilis and Bacillus pumilus. In a 
particularly preferred embodiemnt, the KPAR-O microorganism is Bacillus subtilis. 

In still other embodiments, the KPAR-O microorganism further overexpresses at 
least one pantothenate biosynthetic enzyme in addition to ketopantoate reductase. In an 

25 exemplary embodiment, the KPAR-O microorganism further overexpresses at least one 
of ketopantoate hydroxymethyltransferase, pantothenate synthetase and aspartate-a- 
decarboxylase. Also featured are methods of producing panto-compounds, for example, 
methods that involve culturing a KPAR-O microorganism, which further include the 
step of recovering the panto-compound. 

30 

III Methods of Producing Panto-Compounds Independent of Precursor Feed 
Requirements 

Depending on the biosynthetic enzyme or combination of biosynthetic enzymes 
manipulated, it may be desirable or necessary to provide (e.g., feed) microorganisms of 
35 the present invention at least one pantothenate biosynthetic precursor such that 

pantothenate or other desired panto-compounds are produced. The term "pantothenate 
biosynthetic precursor" or "precursor" includes an agent or compound which, when 
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provided to, brought into contact with, or included in the culture medium of a 
microorganism, serves to enhance or increase pantothenate biosynthesis. In one 
embodiment, the pantothenate biosynthetic precursor or precursor is aspartate. In 
another embodiment, the pantothenate biosynthetic precursor or precursor is«p-alanine. 
5 The amount of aspartate or {J-alanine added is preferably an amount that results in a 
concentration in the culture medium sufficient to enhance productivity of the 
microorganism (e.g., a concentration sufficient to enhance production of a panto- 
compound, for example, P-alanine, ketopantoate, pantoate or pantothenate). 
Pantothenate biosynthetic precursors of the present invention can be added in the form 
10 of a concentrated solution or suspension (e.g. , in a suitable solvent such as water or 

buffer) or in the form of a solid (e.g., in the form of a powder). Moreover, pantothenate 
biosynthetic precursors of the present invention can be added as a single aliquot, 
continuously or intermittently over a given period of time. 

In yet another embodiment, the pantothenate biosynthetic precursor is valine, see 
1 5 e.g. , Example HI. In yet another embodiment, the pantothenate biosynthetic precursor is 
a-ketoisovalerate. Preferably, valine or a-ketoisovalerate is added in an amount that 
results in a concentration in the medium sufficient for production of the desired product 
(e.g., panto-compound) to occur. Pantothenate biosynthetic precursors are also referred 
to herein as "supplemental pantothenate biosynthetic substrates". 
20 Providing pantothenate biosynthetic precursors in the pantothenate biosynthetic 

methodologies of the present invention,-can be associated with high costs, for example, 
when the methodologies are used to produce high yields of panto-compounds. 
Accordingly, preferred methodologies of the present invention feature microorganisms 
having at least one biosynthetic enzyme or combination of biosynthetic -enzymes (e.g., at 
25 least one pantothenate biosynthetic enzyme and/or valine-isoleucine biosynthetic 
enzyme) manipulated such that pantothenate or other desired panto-compounds are : 
produced in a manner independent of precursor feed. The phrase "a manner 
independent of precursor feed", for example, when referring to a method for producing a 
desired compound (e.g., a panto-compound), includes an approach to or a mode of 
30 producing the desired compound that does not depend or rely on precursors being 
provided (e.g., fed) to the microorganism being utilized to produce the desired 
compound. For example, microorganisms featured in the methodologies of the present 
invention can be used to produce panto-compounds in a manner requiring no feeding of 
the precursors aspartate, p-alanine, valine and/or a-KIV. 
35 Alternative preferred methodologies of the present invention feature 

microorganisms having at least one biosynthetic enzyme or combination of biosynthetic 
enzymes manipulated ^such that pantothenate or other desired panto-compounds are 
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produced in a manner substantially independent of precursor feed. The phrase "a 
manner substantially independent of precursor feed" includes an approach to or a 
method of producing the desired compound that depends or relies to a lesser extent on 
precursors being provided (e.g., fed) to the microorganism being utilized. For example, 
5 microorganisms featured in the methodologies of the present invention can be used to 
produce panto-compounds in a manner requiring feeding of substantially reduced 
amounts of the precursors aspartate, P-alanine, valine and/or a-KIV. In one 
embodiment, the invention features methods of producing panto-compounds (e.g., 
pantothenate) in a manner that requires feeding of less than 5%-10% of the amount of 

10 precursor required by a control microorganism (e.g., a microorganism that is dependent, 
for example is wholly dependent, on precursor feed to efficiently produce the desired 
compound). In another embodiment, the invention features methods of producing 
panto-compounds in a manner that requires feeding of less than 15-20% of the amount 
of precursor required by a control microorganism. In another embodiment, the 

1 5 , invention features methods of producing panto-compounds in a manner that requires 
feeding of less than 25-30%, 35-40%, 45-50% or 55-60% of the amount of precursor 
required by a control microorganism. As described in Examples I-III herein, particular 
microorganisms featured in the methodologies of the present invention require, for 
example, 5 g/L of aspartate, p-alanine, valine or a-KIV (e.g. , in test tube or in shake 

20 flask cultures). Accordingly, in a preferred embodiment, the present invention features 
methods of producing panto-compounds (e.g., pantothenate) in a manner requiring 
feeding of less than 0.25 g/L, 0.5 g/L, 0.75 g/L, 1 g/L, 1.25 g/L, 1.5 g/L, 1.75 g/L, 2 g/L, 
2.25 g/L, 2.5 g/L, 2.75 g/L or 3 g/L. 

Preferred methods of producing desired compounds (e.g., panto-compounds) in a 

25 manner independent of precursor feed or alternatively, in a manner substantially 
independent of precursor feed, involve culturing microorganisms which have been 
manipulated (e.g., designed or engineered, for example, genetically engineered) such 
that expression of at least one pantothenate biosynthetic enzyme, and/or at least one 
isoleucine- valine biosynthetic enzyme, is modified. For example, in one embodiment, a 

30 microorganism is manipulated (e.g. , designed or engineered) such that the production of 
at least one pantothenate biosynthetic enzyme, and/or at least one isoleucine/valine 
biosynthetic enzyme is deregulated. In a preferred embodiment, a microorganism is 
manipulated (e.g., designed or engineered) such that it has a deregulated biosynthetic 
pathway, for example, a deregulated pantothenate biosynthesis pathway and/or a 

35 deregulated isoleucine-valine biosynthetic pathway, as defined herein. In another 
preferred embodiment, a microorganism is manipulated (e.g., designed or engineered) 
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such that at least one pantothenate biosynthetic enzyme, and/or at least one isoleucine- 
valine biosynthetic enzyme is overexpressed. 

Preferred methods of producing desired compounds <«.g., panto-compounds) in a 
manner independent of precursor feed or alternatively, in a manner substantially 
5 independent of precursor feed, are as follows. In one embodiment, the invention 

features a method of producing pantothenate in a manner independent of precursor feed 
comprising culturing an aspartate-a-decarboxylase-overexpressing (AaD-O) 
microorganism having a deregulated isoleucine-valine {i/v) pathway under conditions 
such that pantothenate is produced. In another embodiment, the invention features a 
1 0 method of producing pantothenate in a manner independent of precursor feed 
comprising culturing an aspartate-a-decarboxylase-overexpressing (AaD-O) 
microorganism having a deregulated pantothenate (pan) pathway'and a deregulated 
isoleucine-valine (;7v) pathway, under conditions such that pantothenate is produced. In 
another embodiment, the invention features a method of producing pantothenate in a 
1 5 manner independent of aspartate or p-alanine feed comprising culturing an aspartate-a- 
decarboxylase-overexpressing (AaD-O) microorganism under conditions such that 
pantothenate is produced. In yet another embodiment, the invention features a method 
of producing pantothenate in a manner independent of valine or a-ketoisovalerate 'feed 
comprising culturing a microorganism having a deregulated isoleucine-valine <//v) 
20 biosynthetic pathway under conditions such that pantothenate is produced. 

The term "aspartate-a-decarboxylase-overexpressing (AaD-O) microorganism" 
includes a microorganism which has been manipulated such that aspartate-a- 
decarboxylase is overexpressed. A preferred "aspartate-a-decarboxylase-overexpressing 
(AaD-O) microorganism" has been transformed with a vector comprising a B. subtilis 
25 panD nucleic acid sequence {e.g. , a panD nucleic acid sequence that encodes an 

aspartate-a-decarboxylase protein having the amino acid sequence of SEQ ID NO:28, 
for example, a panD nucleic acid sequence as set forth inSEQ ID NO:27). 

The phrase "microorganism having a deregulated tsoleucine-va1ine<j/v) 
pathway" includes a microorganism having an alteration or modification in at least one 
30 gene encoding an enzyme of the isoleucine-valine (tfv) pathway or having an alteration 
or modification in an operon including more than one-gene encoding an-enzyme of the 
isoleucine-valine (i/v) pathway. A preferred "microorganism having a deregulated 
isoleucine-valine<i7v) pathway" overexpresses aeetohydroxyacid synthetase (e.g., 
acetohydroxyacid synthetase having subunits having the amino acid sequences of SEQ 
35 ID NO:32 and SEQ ID NO:34 or acetohydroxyacid synthetase having the amino acid 
sequence of SEQ ID NO:87), acetohydroxyacid isomeroreductase {having the amino 
acid sequence of SEQ ID NO:36), or dihydroxyacid dehydratase (having the amino acid 
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sequence of SEQ ID NO:38) anoVor has been transformed with a vector comprising ilvB, 
ilvN, ilvC, ilvBN, ilvBNC or alsS nucleic acid sequences (SEQ ID NO:31, SEQ ID 
NO:33, SEQ ID NO:35, nucleotides 1-2246 of SEQ ID NO:58, SEQ ID NO:58 having 
coding regions from nucleotides 1-1725, 1722-2246 and 2263-3291, or SEQ ID NO:86, 
5 respectively) and/or an UvD nucleic acid sequence (SEQ ID NO:37). 

IV. High Yield Production Methodologies 

A particularly preferred embodiment of the present invention is a high yield 
production method for producing pantothenate comprising culturing a manipulated 

10 microorganism under conditions such that pantothenate is produced at a significantly 
high yield. The phrase "high yield production method", for example, a high yield 
production method for producing a desired compound (e.g., for producing a panto- 
compound) includes a method that results in production of the desired compound at a 
level which is elevated or above what is usual for comparable production methods. 

1 5 Preferably, a high yield production method results in production of the desired 

compound at a significantly high yield. The phrase "significantly high yield" includes a 
level of production or yield which is sufficiently elevated or above what is usual for 
comparable production methods, for example, which is elevated to a level sufficient for 
commercial production of the desired product (e.g., production of the product at a 

20 commercially feasible cost). In one embodiment, the invention features a high yield 
production method of producing pantothenate that includes culturing a manipulated 
microorganism under conditions such that pantothenate is produced at a level greater 
than 2 g/L. In another embodiment, the invention features a high yield production 
method of producing pantothenate that includes culturing a manipulated microorganism 

25 under conditions such that pantothenate is produced at a level greater than 1 0 g/L. In 
another embodiment, the invention features a high yield production method of 
producing pantothenate that includes culturing a manipulated microorganism under 
conditions such that pantothenate is produced at a level greater than 20 g/L. In yet 
another embodiment, the invention features a high yield production method of 

30 producing pantothenate that includes culturing a manipulated microorganism under 
conditions such that pantothenate is produced at a level greater than 30 g/L. In yet 
another embodiment, the invention features a high yield production method of 
producing pantothenate that includes culturing a manipulated microorganism under 
conditions such that pantothenate is produced at a level greater than 40 g/L. 

35 The invention further features a high yield production method for producing a 

desired compound (e.g., for producing a panto-compound) that involves culturing a 
manipulated microorganism under conditions such that a sufficiently elevated level of 
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compound is produced within a commercially desireable period of time, in an 
exemplary embodiment, the invention features a high yield production method of 
producing pantothenate that includes culturing a manipulated microorganism under 
conditions such that pantothenate is produced at a level greater than 15-20 g/L in 36 
5 hours. In another embodiment, the invention features a high yield production method of 
producing pantothenate that includes culturing a manipulated microorganism under 
conditions such that pantothenate is produced at a level greater than 25-30^/L in 48 
hours. In another embodiment, the invention features a high yield production method of 
producing pantothenate that includes culturing a manipulated microorganism under 
10 conditions such that pantothenate is produced at a level greater than 35-40 g/L in 72 
hours, for example, greater that 37 g/L in 72 hours. In another-embodiment, the 
invention features a high yield production method of producing pantothenate that 
includes culturing a manipulated microorganism under conditions *uch that pantothenate 
is produced at a level greater than 30-40 g/L in 60 hours, for example, greater that 30, 35 
15 or 40 g/L in 60 hours. Values and ranges included and/or intermediate within the ranges 
set forth herein are also intended to be within the scope of the present invention. For 
example, pantothenate production at levels of at least 3 1 , 32, 33, 34, 35, 36, 37, 38 and 
39 g/L in 60 hours are intended to be included within the range of 30-40 g/L in 60 hours. 
In another example, ranges of 30-35 g/L or 35-40 g/L are intended to be included within 
20 the range of 30-40 g/L in 60 hours. Moreover, the skilled artisan will appreciate that 
culturing a manipulated microorganism to achieve a production level of, for example, 
"30-40 g/L in 60 hours" includes culturing the microorganism for additional time 
periods (e.g., time periods longer than 60 hours), optionally resulting in even higher 
yields of pantothenate being produced. 

25 

V. Panto-Compound Production Methodologies Featuring Pantothenate. 
Kinase Mutant Microorganisms 

The present invention relates to methods of producing pantothenate using 
microorganisms engineered to produce high yields of pantothenate as well as other 
30 panto-compounds. Cells overproducing pantothenate result in high intracellular 

pantothenate levels that could overcome the feedback inhibition of pantothenate kinase 
by CoA, leading to overproduction of CoA. Besides consuming pantothenate, increased 
synthesis of CoA may cause increased feedback inhibition of the PanB, PanD, PanE or 
PanC reaction, thereby limiting pantothenate production. Accordingly, a reduction in 
35 pantothenate kinase activity may lead to a decrease in CoA levels with resulting 
increases in PanB, PanD, PanE or PanC activity and pantothenate production. 
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Thus, certain methodologies of the present invention are based, at least in part, 
on the identification and characterization of the B. subtilis coaA gene and the 
demonstration that the gene is neither essential for B. subtilis growth deletion of the 
coaA gene from the chromosome of B. subtilis is not a lethal event) nor for pantothenate 
5 kinase activity in B. subtilis. A second pantothenate kinase-encoding gene has been 
identified and characterized in B. subtilis, and is termed "coaA*\ This gene 
complements an E. coli mutant that contains a temperature sensitive pantothenate kinase 
and is not related by homology to any previously known pantothenate kinase gene. 
In one aspect, the methodologies of the invention feature recombinant 
1 0 microorganisms that include the coaX gene or that include a mutant coaX gene, having 
reduced pantothenate kinase activity. In one embodiment, the methodologies feature 
such recombinant microorganisms further having a deregulated pantothenate 
biosynthetic pathway. In another embodiment, the methodologies feature such ♦ 
recombinant microorganisms further having a deregulated isoleucine-valine {ilv) 
1 5 pathway. In a preferred embodiment, the microorganisms belong to the genus Bacillus 
(e.g., B. subtilis). 

The methodologies of the invention also feature recombinant microorganisms 
(eg., microorganisms belong to the genus Bacillus, for example, B. subtilis) that 
include the coaA gene or that include a mutant coaA gene, optionally including a coaX 
20 gene or mutant thereof, having reduced pantothenate kinase activity. In one 

embodiment, the methodologies feature such recombinant microorganisms further 
having a deregulated pantothenate biosynthetic pathway or having a deregulated 
isoleucine-valine (ilv) pathway. Also featured are vectors that include isolated coaX or 
coaA genes as well as mutant coaX and/or coaA genes. Isolated nucleic acid molecules 
25 that include isolated cooT genes or mutant coaX genes are features in addition to 
isolated CoaX proteins and mutant CoaX proteins. 

The above-described nucleic acid molecules (e.g., genes), proteins, vectors, and 
recombinant microorganisms (e.g., mutant microorganisms), are particularly suited for 
use in methods of producing panto-compounds and/or methods of enhancing panto- 
30 compound production. In one embodiment, the invention features a method for 

producing a panto-compound (e.g., pantothenate) that includes culturing a pantothenate 
kinase mutant (e.g., a recombinant microorganism that misexpresses, e.g., is mutated 
for, pantothenate kinase, as defined herein) under conditions such that panto-compound 
is produced. In another embodiment, the invention features a method for enhancing 
35 production of a panto-compound {e.g. , pantothenate) that includes culturing a 

pantothenate kinase mutant (e.g., a recombinant microorganism that misexpresses, e.g., 
is mutated for, pantothenate kinase, as defined herein) under conditions such that 
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production of the panto-compound is produced. As used herein, the term "enhancing" 
(for example, in the context of the phrase "enhancing production") includes increasing 
the level or rate of production of panto-compound {e.g., pantothenate) as compared to 
the level or rate of production in a non-mutant microorganism (e.g. , a microorganism 
5 having a normal pantothenate kinase genets) and/or having normal pantothenate 
production rates and/or levels. 

Preferably, the level of panto-compound produced in methodologies featuring 
the pantothenate kinase mutants of the present invention is increased by at least 5% as 
compared to the level produced by a non-mutant (e.g., a recombinant microorganism 
10 expressing non-mutated pantothenate kinase). Even more preferably, the level of panto- 
compound is increased 10% as compared to methodologies featuring non-mutants. 
Even more preferably, panto-compound levels (e.g. , pantothenate levels) are increased 
20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, are increased 2-fold, 5-fold, 10-fold, 50- 
fold, 100-fold or more as compared to methodologies featuring non-mutants. 

15 

VI. Additional Mutations Resulting in Enhanced Panto-Compound 
Production 

The methodologies of the present invention further -can include, for example in 
addition to overexpressing or deregulating a pantothenate biosynthetic enzyme and/or an 

20 isoleucine-valine biosynthetic enzyme, or in addition to mutating a pantothenate-kinase 
encoding gene, deleting or mutating an enzyme that catalyzes the conversion of key 
pantothenate biosynthesis substrates or precursors to unwanted or undesirable products. 
For example, mutating the ilvE gene (Kuramitsu et al. (1985) J. Biochem. 97:993-999) 
or a homologue thereof (SEQ ID NO:62 or SEQ ID NO:64), thereby limiting the 

25 conversion of a-ketoisovalerate to valine, in addition to mutating a pantothenate kinase 
encoding enzyme, is predicted to result in even further enhanced or increased production 
of panto-compound. Alternatively, mutating the ansB&ne {Sun and Seflow (199 1 ) J. 
Bacteriol. 173:3831-3845) or a homologue thereof <SEQ ID NO:66), thereby limiting 
the degradation of aspartate, in addition to mutating a pantothenate kinase encoding 

30 enzyme, is predicted to result in even further enhanced or increased production of panto- 
compound. Alternatively, mutating the alsD&ne (Renna et al. ( 1 993) J. Bacteriol. 
175 :3863-3875) or a homologue thereof (SEQ ID NO:68), thereby limiting the 
conversion of acetolactate to acetoin, in addition to mutating a pantothenate kinase 
encoding enzyme, is predicted to result in even further enhanced or increased production 
35 of panto-compound. Alternatively, mutating the avtA gene encoding alanine-valine 
transaminase or a homologue thereof, thereby limiting the conversion of a- 
ketoisovalerate to valine, in addition to mutating a pantothenate kinase encoding 
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enzyme, is predicted to result in even further enhanced or increased production of panto- 
compound. Mutating the avtA gene can include mutating, for example, an avtA gene 
having the nucleotide sequence of SEQ ID NO:70 (e.g., the E. coli avtA gene), or a 
structural homolog thereof (e.g., a homologue encoding a protein having 30-40%, 40- 

5 50%, 50-60%, 60-70%, 70-80%, 80-90%, 90-95% or more identity with the amino acid 
sequence of SEQ ID NO:7 1 ) or a functional homologue (e.g. , a gene encoding a 
structurally unrelated protein having alanine-valine transmainase activity. Such 
mutations can be accomplished using the methodologies as exemplified in the Examples 
(e.g., Examples XIII, XV, XVI and XVII). 

10 Accordingly, in one embodiment, the invention features a method of producing a 

panto-compound which includes culturing a microorganism having a mutant 
pantothenate kinase-encoding gene and which further has a deletion or mutation in an 
avtA, ilvE, ansB, and/or alsD gene, or homologue thereof. In aAother embodiment, the 
invention features a method of producing a panto-compound which includes culturing a 

1 5 microorganism having a mutant pantothenate-kinase encoding gene and a deregulated 
pantothenate biosynthetic pathway enzyme and which further has a deletion or mutation 
in an avtA, ilvE, ansB, and/or alsD gene, or homologue thereof. In another embodiment, 
the invention features a method of producing a panto-compound which includes 
culturing a microorganism having a mutant pantothenate-kinase encoding gene and a 

20 deregulated isoleucine- valine biosynthetic pathway enzyme and which further has a 
deletion or mutation in an avtA, ilvE, ansB, and/or alsD gene, or homologue thereof. 

Mutating the alsD gene can be particularly useful when accomplished in 
conjunction with overexpression or deregulation of the alsS gene, for example, to 
prevent carbon (e.g., acetolactate) from being drawn away from the precursor pool 

25 utilized for a-KI V production. Accordingly, to maximize the contribution of the als 
locus to panto-compound production, it is desirable to disrupt the alsD gene in addition 
to overexpressing the alsS gene. To disrupt the alsD gene, appropriate fragments of the 
als operon, flanking the alsD gene, are amplified by PCR and cloned to provide 
homology for creating the disruptions. A drug resistance gene, such as the cat gene, is 

30 cloned between the flanking DNA fragments in place of the alsD gene, and the 

linearized DNA is transformed into a pantothenate production strain such as PA824, 
selecting for drug-resistance. To overexpress alsS, the alsS coding sequence (e.g., an 
alsS coding sequence that has been engineered by PCR for expression) is cloned into an 
expression vector. Vectors which express alsS (or alternatively, vectors which express 

35 alsS ploy ilvC) are the introduced into panto-compound production strains {e.g. , the 
pantothenate producing strain PA824). 
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The methodologies of the present invention further can include, for example in 
addition to overexpressing or deregulating a pantothenate biosynthetic enzyme and/or an 
isoleucine-valine biosynthetic enzyme, or in addition to mutating a pantothenate-kinase 
encoding gene, deleting or mutating an enzyme that catalyzes the conversion of desired , 
5 panto-compounds to unwanted or undesireable downstream products. 

VII Isolated Nucleic Acid Molecules and Genes 

Another aspect of the present invention features isolated nucleic acid molecules 
that encode Bacillus proteins {e.g. , B. subtilis proteins), for example, Bacillus 
1 0 pantothenate biosynthetic enzymes (e.g. , B. subtilis pantothenate biosynthetic enzymes) 
or Bacillus valine-isoleucine biosynthetic enzymes <e.g., B. subtilis valine-isoleucine 
biosynthetic enzymes). Also featured are isolated coaX and/or coaA nucleic acid 
molecules (eg., isolated coaX and/or coaA genes) as well as isolated nucleic acid 
.molecules that include such coaX and/or coaA nucleic acid molecules or genes. 
1 5 The term "nucleic acid molecule" includes DNA molecules (e.g. , linear, circular, 

cDNA or chromosomal DNA) and RNA molecules^.*., tRNA, rRNA, mRNA) and 
analogs of the DNA or RNA generated using nucleotide analogs. The nucleic acid 
molecule can be single-stranded or double-stranded, but preferably is double-stranded 
DNA. The term "isolated" nucleic acid molecule includes a nucleic acid molecule 
20 which is free of sequences which naturally flank the nucleic acid molecule (/. e. , 

sequences located at the 5' and 3* ends of the nucleic acid molecule) in the -chromosomal 
DNA of the organism from which the nucleic acid is derived. In variousembodiments, 
an isolated nucleic acid molecule can contain less than about 10 kb, 5 kb, 4kb, 3kb, 2kb, 
1 kb, 0.5 kb, 0.1 kb, 50 bp, 25 bp or 10 bp of nucleotide sequences which naturally flank 
25 the nucleic acid molecule in chromosomal DNA of the microorganism from which the 
nucleic acid molecule is derived. Moreover, an "isolated" nucleic acid molecule, such 
as a cDNA molecule, can be substantially free of other cellular materials when produced 
by recombinant techniques, or substantially free of chemical precursors or other 
chemicals when chemically synthesized. 
30 The term "gene", as used herein, includes a nucleic acid molecule (e.g. , a DNA 

molecule or segment thereof), for example, a protein or RNA-encoding nucleic acid 
molecule, that in an organism, is separated from another gene or other genes, by 
intergenic DNA (i.e., intervening or spacer DNA which naturally flanks the gene and/or 
separates genes in the chromosomal DNA of the organism). A gene may direct 
3 5 synthesis of an enzyme or other protein molecule (e.g. , may comprise coding sequences, 
for example, a contiguous open reading frame <ORF) which encodes a protein) or may 
itself be functional in the organism. A gene in an organism, may be clustered in an 
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operon, as defined herein, said operon being separated from other genes and/or operons 
by the intergenic DNA. Individual genes contained within an operon may overlap 
without intergenic DNA between said individual genes. An "isolated gene", as used 
herein, includes a gene which is essentially free of sequences which naturally flank the 
5 gene in the chromosomal DNA of the organism from which the gene is derived (i.e., is 
free of adjacent coding sequences which encode a second or distinct protein or RNA 
molecule, adjacent structural sequences or the like) and optionally includes 5' and 3' 
regulatory sequences, for example promoter sequences and/or terminator sequences. In 
one embodiment, an isolated gene includes predominantly coding sequences for a 

1 0 protein {e.g., sequences which encode Bacillus proteins). In another embodiment, an 
isolated gene includes coding sequences for a protein {e.g., for a Bacillus protein) and 
adjacent 5' and/or 3' regulatory sequences from the chromosomal DNA of the organism 
from which the gene is derived {e.g., adjacent 5' and/or 3' Bacillus regulatory 
sequences). Preferably, an isolated gene contains less than about 10 kb, 5 kb, 2 kb, 1 kb, 

15 0.5 kb, 0.2 kb, 0.1 kb, 50 bp, 25 bp or 10 bp of nucleotide sequences which naturally 
flank the gene in the chromosomal DNA of the organism from which the gene is 
derived. 

In one aspect, the present invention features isolated panB nucleic acid 
sequences or genes, isolated panC nucleic acid sequences or genes, isolated panD 

20 nucleic acid sequences or genes, isolated panE nucleic acid sequences or genes, isolated 
ilvB, ilvN, ilvBN nucleic acid sequences or genes, isolated alsS nucleic acid sequences or 
genes, isolated ilvC nucleic acid sequences or genes and/or isolated ilvD nucleic acid 
sequences or genes. 

In a preferred embodiment, the nucleic acid or gene is derived from Bacillus 

25 {e.g., is 2taci//us-derived). The term "derived from Bacillus" or "flac/V/us-derived" 

includes a nucleic acid or gene which is naturally found in microorganisms of the genus 
Bacillus. Preferably, the nucleic acid or gene is derived from a microorganism selected 
from the group consisting of Bacillus subtilis, Bacillus lentimorbus, Bacillus lentus, 
Bacillus firmus, Bacillus pantothenticus, Bacillus amyloliquefaciens, Bacillus cereus, 

30 Bacillus circulans, Bacillus coagulans, Bacillus licheniformis, Bacillus megaterium, 
Bacillus pumilus, Bacillus thuringiensis, and other Group 1 Bacillus species, for 
example, as characterized by 16S rRNA type (Priest, supra). In another preferred 
embodiment, the nucleic acid or gene is derived from Bacillus brevis or Bacillus 
stearothermophilus. In another preferred embodiment, the nucleic acid molecules 

35 and/or genes of the present invention are derived from a microorganism selected from 
the group consisting of Bacillus licheniformis, Bacillus amyloliquefaciens, Bacillus 
halodurans, Bacillus subtilis, and Bacillus pumilus. In a particularly preferred 
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embodiment, the nucleic acid or gene is derived from Bacillus subtilis (e.g., is Bacillus 
5ufe//7/5-derived). The lerm "derived from Bacillus subtilis" or "Bacillus subtilis- 
derived" includes a nucleic acid or gene which is naturally found in BaciUus subtilis. In 
yet another preferred embodiment, the nucleic acid or gene is a Bacillus gene 
5 homologue (e.g. , is derived from a species distinct from Bacillus but having significant 
homology to a Bacillus gene of the present invention, for example, a Bacillus pan gent 
or Bacillus ilv gene). 

Included within the scope of the present invention are bacterial-derived nucleic 
acid molecules or genes and/or £ac///itf-derived nucleic acid molecules or genes <<?.g., 
10 B. subtilis-derived nucleic acid molecules or genes), for example, the genes identified by 
the present inventors, for example, Bacillus or B. subtilis cooT genes, coaA genes, pan 
genes and/or ilv genes. Further included within the scope of the present invention are 
bacterial-derived nucleic acid molecules or genes and/or BacHlus-dzrived nucleic acid 
molecules or genes (e.g., B. suftrifo-derived nucleic acid molecules or genes) (e.g., B. 
1 5 subtilis nucleic acid molecules or genes) which differ from naturally-occurring bacterial 
and/or Bacillus nucleic acid molecules or genes (e.g., B. subtilis nucleic acid molecules 
or genes), for example, nucleic acid molecules or genes which have nucleic acids that 
are substituted, inserted or deleted, but which encode proteins substantially similar to the 
naturally-occurring gene products of the present invention. In one embodiment, an 
20 isolated nucleic acid molecule comprises at least one of the nucleotide sequences set 
forth as SEQ ID NO:23, SEQ ID NO:25, SEQ ID NO:27, SEQ ID NO:29, SEQ ID NO 
31, SEQ ID NO:33, SEQ ID NO:86, SEQ ID NO:35 or SEQ ID NO:37. In another 
preferred embodiment, an isolated nucleic acid molecule comprises at least two, three or 
four of the nucleotide sequences set forth as SEQ ID NO:23, SEQ ID NO:25, SEQ ID 
25 NO:27, SEQ ID NO:29, SEQ ID NO 31, SEQ ID NO:33, SEQ ID NO:88, SEQ ID 
NO:35orSEQlDNO:37. For example, a preferred isolated nucleic acid molecule of 
the present invention can include the nucleotide sequences of SEQ IDNO:23,SEQ ID 
NO:25 and SEQ ID NO:27, preferably linked such that the proteins encoded by the 
nucleotide sequences of SEQ ID NO:23, SEQ ID NO:25 and SEQ ID NO:27 are each 
30 produced when the isolated nucleic acid molecule is expressed in a microorganism g., 
SEQ ID NO:59). In another example, a preferred isolated nucleic acid molecule of the 
present invention can include the nucleotide sequences of SEQ ID NO:31 and SEQ ID 
NO:33, preferably linked such that the proteins encoded by the nucleotide sequences of 
SEQ ID NO:3 1 and SEQ ID NO:33 are each produced when the isolated nucleic acid 
35 molecule is expressed in a microorganism (e.g. , nucleotides 1 -2246 of SEQ ID NO:58). 
In another example, a preferred isolated nucleic acid molecule of the present invention 
can include the nucleotide sequence of SEQ IDMO:86. in another example, a preferred 
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isolated nucleic acid molecule of the present invention can include the nucleotide t 
sequences of SEQ ID NO:31, SEQ ID NO:33 and SEQ ID NO:35, preferably linked 
such that the proteins encoded by the nucleotide sequences of SEQ IDNO:31, SEQ ID 
NO:33 and SEQ ID NO:35 are each produced when the isolated nucleic acid molecule is 
5 expressed in a microorganism (e.g., SEQ ID NO:58). 

In another embodiment, an isolated nucleic acid molecule, of the present 
invention comprises a nucleotide sequence which is at least about 60-65%, preferably at 
least about 70-75%, more preferable at least about 80-85%, and even more preferably at 
least about 90-95% or more identical to a nucleotide sequence set forth as SEQ ID 
10 NO:23, SEQ ID NO:25, SEQ ID NO:27, SEQ ID NO:29, SEQ ID NO 31, SEQ ID 

NO:33, SEQ ID NO:88, SEQ ID NO:35 or SEQ ID NO:37. In another embodiment, an 
isolated nucleic acid molecule hybridizes under stringent conditions to a nucleic acid 
molecule having a nucleotide sequence set forth as SEQ ID N0123, SEQ ID NO:25, 
SEQ ID NO:27, SEQ ID NO:29, SEQ ID NO 31, SEQ ID NO:33, SEQ ID NO:88, SEQ 
1 5 ID NO:35 or SEQ ID NO:37. Such stringent conditions are known to those skilled in 
the art and can be found in Current Protocols in Molecular Biology, John Wiley & Sons, 
N.Y. (1989), 6.3.1-6.3.6. A preferred, non-limiting example of stringent (e.g. high 
stringency) hybridization conditions are hybridization in 6X sodium chloride/sodium 
citrate (SSC) at about 45°C, followed by one or more washes in 0.2 X SSC, 0.1% SDS 
20 at 50-65°C. Preferably, an isolated nucleic acid molecule of the invention that 
hybridizes under stringent conditions to the sequence of SEQ ID NO:23, SEQ ID 
NO:25, SEQ ID NO:27, SEQ ID NO:29, SEQ ID NO 31, SEQ ID NO:33, SEQ ID 
NO:88, SEQ ID NO:35 or SEQ ID NO:37 corresponds to a naturally-occurring nucleic 
acid molecule. As used herein, a "naturally-occurring" nucleic acid molecule refers to 
25 an RNA or DNA molecule having a nucleotide sequence that occurs in nature. 

A nucleic acid molecule of the present invention (e.g., a nucleic acid molecule 
having the nucleotide sequence of SEQ ID NO:23, SEQ ID NO:25, SEQ ID NO:27, 
SEQ ID NO:29, SEQ ID NO 31, SEQ ID NO:33, SEQ ID NO:88, SEQ ID NO:35 or 
SEQ ID NO:37 can be isolated using standard molecular biology techniques and the 
30 sequence information provided herein. For example, nucleic acid molecules can be 
isolated using standard hybridization and cloning techniques (e.g., as described in 
Sambrook, J., Fritsh, E. F., and Maniatis, T. Molecular Cloning: A Laboratory Manual. 
2nd, ed., Cold Spring Harbor Laboratory Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, NY, 1989) or can be isolated by the polymerase chain reaction using 
35 synthetic oligonucleotide primers designed based upon the sequence of SEQ ID NO:23, 
SEQ ID NO:25, SEQ ID NO:27, SEQ ID NO:29, SEQ ID NO 31, SEQ ID NO:33, SEQ 
ID NO:88, SEQ ID NO:35 or SEQ ID NO:37. A nucleic acid of the invention can be 
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amplified using cDNA, mRNA or alternatively, genomic DNA, as a template and 
appropriate oligonucleotide primers according to standard PCR amplification 
techniques. In another preferred embodiment, an isolated nucleic acid molecule of the 
invention comprises a nucleic acid molecule which is a complement of the nucleotide 
5 sequence shown in SEQ ID NO:23, SEQ ID NO:2S, SEQ ID NO:27, SEQ ID NO:29, 
SEQ ID NO:33, SEQ ID NO 31, SEQ ID NO:33, SEQ ID NO:88, SEQ IDNO:35. 

Additional panC nucleic acid sequences include those that comprise the 
nucleotide sequence of SEQ ID NO:25, encode a homologue of the polypeptide having 
the amino acid sequence set forth in SEQ ID NO:26<<? g., encode a polypeptide having 
1 0 at least 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95% or more identity to the polypeptide 
having the amino acid sequence as set forth in SEQ ID NO:26 and a substantially 
identical activity as said polypeptide), hybridize under •stringent conditions to all or a 
portion of a nucleic acid molecule having the nucleotide sequence of SEQ ID NO:25 or 
to all or a portion of a nucleic acid molecule that encodes a polypeptide having the 
1 5 amino acid sequence of SEQ ID NO:26, or are complementary to apanC nucleotide 
sequence as set forth herein. 

Aditional panD nucleic acid sequences include those that -comprise the 
nucleotide sequence of SEQ ID NO:27, encode a homologue of the polypeptide having 
the amino acid sequence set forth in SEQ ID NO:28 (e.g., encode a polypeptide having 
20 at least 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95% or more identity to the polypeptide 
having the amino acid sequence as set forth in SEQ ID NO:28 and a substantially 
identical activity as said polypeptide), hybridize under stringent conditions to all or a 
portion of a nucleic acid molecule having the nucleotide sequence of SEQ ID NO:27 or 
to all or a portion of a nucleic acid molecule that encodes a polypeptide having the 
25 amino acid sequence of SEQ ID NO:28, or are complementary to apanD nucleotide 
sequence as set forth herein. 

Additional panE nucleic acid sequences include those that comprise the 
nucleotide sequence of SEQ ID N0.29, encode a homologue of the polypeptide having 
the amino acid sequence set forth in SEQ ID NO:30 {e g. , encode a polypeptide having 
30 at least 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95% or more identity to the polypeptide 
having the amino acid sequence as set forth in SEQ ID NO:30 and a substantially 
identical activity as said polypeptide), hybridize under stringent conditions to all or a 
portion of a nucleic acid molecule having the nucleotide sequence of SEQ ID*IO:29 or 
to all or a portion of a nucleic acid molecule that encodes a polypeptide having the 
35 amino acid sequence of SEQ ID NO:30, or are complementary to a panE nucleotide 
sequence asset forth herein. 
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Additional ilvB nucleic acid sequences are those that comprise the nucleotide 
sequence of SEQ ID NO:3 1 , encode a homologue of the pplypeptide having the amino 
acid sequence set forth in SEQ ID NO:32 (e.g., encode a polypeptide having at least 
30%, 40%, 50%, 60%, 70%, 80%, 90%, 95% or. more identity to the polypeptide having 
5 the amino acid sequence as set forth in SEQ ID NO:32 and a substantially identical 
activity as said polypeptide), hybridize under stringent conditions to all or a portion of a 
nucleic acid molecule having the nucleotide sequence of SEQ ID NO:31 or to all or a 
portion of a nucleic acid molecule that encodes a polypeptide having the amino acid 
sequence of SEQ ID NO:32, or are complementary to an ilvB nucleotide sequence as set 
10 forth herein. 

Additional ilvN nucleic acid sequences are those that comprise the nucleotide 
sequence of SEQ ID NO:33, encode a homologue of the polypeptide having the amino 
acid sequence set forth in SEQ ID NO:34 (e.g., encode a polypeptide having at least 
30%, 40%, 50%, 60%, 70%, 80%, 90%, 95% or more identity to the polypeptide having 

1 5 , the amino acid sequence as set forth in SEQ ID NO:34 and a substantially identical 

activity as said polypeptide), hybridize under stringent conditions to all or a portion of a 
nucleic acid molecule having the nucleotide sequence of SEQ ID NO:33 or to all or a 
portion of a nucleic acid molecule that encodes a polypeptide having the amino acid 
sequence of SEQ ID NO:34, or are complementary to an ilvN nucleotide sequence as set 

20 forth herein. 

Additional ilvC nucleic acid sequences include those that comprise the 
nucleotide sequence of SEQ ID NO:35, encode a homologue of the polypeptide having 
the amino acid sequence set forth in SEQ ID NO:36 {e.g., encode a polypeptide having 
at least 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95% or more identity to the polypeptide 

25 having the amino acid sequence as set forth in SEQ ID NO:36 and a substantially 
identical activity as said polypeptide), hybridize under stringent conditions to all or a 
portion of a nucleic acid molecule having the nucleotide sequence of SEQ ID NO:35 or 
to all or a portion of a nucleic acid molecule that encodes a polypeptide having the 
amino acid sequence of SEQ ID NO:36, or are complementary to an ilvC nucleotide 

30 sequence as set forth herein. 

Additional ilvD nucleic acid sequences include those that comprise the 
nucleotide sequence of SEQ ID NO:37, encode a homologue of the polypeptide having 
the amino acid sequence set forth in SEQ ID NO:38 (e.g., encode a polypeptide having 
at least 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95% or more identity to the polypeptide 

35 having the amino acid sequence ais set forth in SEQ ID NO:38 and a substantially 
identical activity as said polypeptide), hybridize under stringent conditions to all or a 
portion of a nucleic acid molecule having the nucleotide sequence of SEQ ID NO:37 or 
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to all or a portion of a nucleic acid molecule that encodes a polypeptide having the 
amino acid sequence of SEQ ID NO:38, or are complementary to an ilvD nucleotide 
sequence as set forth herein. , 
Additional alsS nucleic acid sequences include those mat comprise the 
5 nucleotide sequence of SEQ ID NO:86, encode a homologue of the polypeptide having 
the amino acid sequence set forth in SEQ ID NO:87 (eg., encode a polypeptide having 
at least 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95% or more identity to the polypeptide 
having the amino acid sequence as set forth in SEQ ID NO:87 and a substantially 
identical activity as said polypeptide), hybridize under stringent conditions to all or a 
1 0 portion of a nucleic acid molecule having the nucleotide sequence of SEQ ID NO:86 or 
to all or a portion of a nucleic acid molecule that encodes a polypeptide having the 
amino acid sequence of SEQ ID NO:87, or are complementary td an alsS nucleotide 
sequence as set forth herein. 

In another embodiment, an isolated nucleic acid molecule is or includes a coaX 
1 5 gene, or portion or fragment thereof. In one embodiment, an isolated coaX nucleic acid 
molecule or gene comprises the nucleotide sequence as set forth in SEQ IDNO:19<e,g, 
comprises the B. subtilis coca nucleotide sequence). In another embodiment, an 
isolated coaX nucleic acid molecule or «ene comprises a nucleotide sequence that 
encodes the amino acid sequence as set forth in "SEQ ID NO:9<e,g, encodes the B. 
20 subtilis CoaX amino acid sequence). In yet anothercmbodiment, an isolated coaX 
nucleic acid molecule or gene encodes a homologue of the CoaX protein having the 
amino acid sequence of SEQ ID NO:9. As used herein, the term "homologue" includes 
a protein or polypeptide sharing at least about 30-35%, preferably at least about 35-40%, 
more preferably at least about 40-50%, and even more preferably at least about 60%, 
25 70%, 80%, 90% or more identity with the amino acid sequence of a wild-type protein or 
polypeptide described herein and having a substantially equivalent functional or 
biological activity as said wild-type protein or polypeptide. For example, a CoaX 
homologue shares at least about 30-35%, preferably at least about 35-40%, more 
preferably at least about 40-50%, andevert more preferably at least about 60%, 70%, 
30 80%, 90% or more identity with the protein having the amino acid sequence set forth as 
SEQ ID NO:9 and has a substantially equivalent functional or biological activity (i.e., is 
a functional equivalent) of the protein having the amino acid sequence set forth as SEQ 
ID NO:9 (e.g., has a substantially equivalent pantothenate kinase activity). In a 
preferred embodiment, an isolated coaX nucleic acid molecule or gene comprises a 
35 nucleotide sequence that encodes a polypeptide as set forth in any one of SEQ ID NO:7, 
SEQ IDNO:8, SEQ ID NO: 10, -SEQ ID NO: 1 1, SEQ 1DN0:12, SEQ ID NO: 13, SEQ 
ID NO: 1 4, SEQ ID NO: 1 5, SEQ ID NO: 1 6, SEQ ID NO: 1 7, SEQ ID NO: 1 8, SEQ ID 
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NO:74 or SEQ ID NO:75. In another embodiment, an isolated coaX nucleic acid 
molecule hybridizes to all or a portion of a nucleic acid molecule having the nucleotide 
sequence set forth in SEQ ID NO:19 or hybridizes to all or a portion of a nucleic acid 
molecule having a nucleotide sequence that encodes a polypeptide having the amino 
5 acid sequence of any of SEQ ID NOs:7-18, 74 or 75. Such hybridization conditions are 
known to those skilled in the art and can be found in Current Protocols in Molecular 
Biology, Ausubel et al, eds., John Wiley & Sons, Inc. (1995), sections 2, 4 and 6. 
Additional stringent conditions can be found in Molecular Cloning: A Laboratory 
Manual, Sambrook et al, Cold Spring Harbor Press, Cold Spring Harbor, NY (1 989), 

1 0 chapters 7, 9 and 1 1 . A preferred, non-limiting example of stringent hybridization 

conditions includes hybridization in 4X sodium chloride/sodium citrate (SSC), at about 
65-70°C (or hybridization in 4X SSC plus 50% formamide at about 42-50°C) followed 
by one or more washes in IX SSC, at about 65-70°C. A preferred, non-limiting 
example of highly stringent hybridization conditions includes hybridization in IX SSC, 

1 5 , at about 65-70°C (or hybridization in IX SSC plus 50% formamide at about 42-50°C) 
followed by one or more washes in 0.3X SSC, at about 65-70°C. A preferred, non- 
limiting example of reduced stringency hybridization conditions includes hybridization 
in 4X SSC, at about 50-60°C (or alternatively hybridization in 6X SSC plus 50% 
formamide at about 40-45°C) followed by one or more washes in 2X SSC, at about 50- 

20 60°C. Ranges intermediate to the above-recited values, e.g. , at 65-70°C or at 42-50°C 
are also intended to be encompassed by the present invention. SSPE (IX SSPE is 0.15 
M NaCl, lOmM NaH 2 P0 4 , and 1 .25 mM EDTA, pH 7.4) can be substituted for SSC (IX 
SSC is 0.1 5 M NaCl and 15 mM sodium citrate) in the hybridization and wash buffers; 
washes are performed for 1 5 minutes each after hybridization is complete. The 

25 hybridization temperature for hybrids anticipated to be less than 50 base pairs in length 
should be 5-10°C less than the melting temperature (T m ) of the hybrid, where T m is 
determined according to the following equations. For hybrids less than 18 base pairs in 
length, T m (°C) = 2(# of A + T bases) + 4(# of G + C bases). For hybrids between 18 and 
49 base pairs in length, T m (°C) = 81.5 + l^logiofNa*]) + 0.41(%G+C) - (600/N), 

30 where N is the number of bases in the hybrid, and [Na + ] is the concentration of sodium 
ions in the hybridization buffer ([Na + ] for IX SSC = 0. 1 65 M). It will also be 
recognized by the skilled practitioner that additional reagents may be added to 
hybridization and/or wash buffers to decrease non-specific hybridization of nucleic acid 
molecules to membranes, for example, nitrocellulose or nylon membranes, including but 

35 not limited to blocking agents {e.g. , BSA or salmon or herring sperm carrier DNA), 
detergents (e.g., SDS), chelating agents (e.g., EDTA), Ficoll, PVP and the like. When 
using nylon membranes, in particular, an additional preferred, non-limiting example of 
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striivgent hybridization conditions is hybridization in 0.25-0.5M NaHaPO-,, 7% SDS at 
about 65°C, followed by one or more washes at 0.02M NaH 2 P0 4 , 1% SDS at-65°C, see 
e.g., Church and Gilbert (1984) Proc. Natl. Acad. Sci. USA 81 : 1991-1995, (or, 
alternatively, 0.2X SSC, 1% SDS). In another preferred embodiment, an isolated 
5 nucleic acid molecule comprises a nucleotide sequence that is complementary to ^oaX 
nucleotide sequence as set forth herein {e.g., is the full complement of the nucleotide 
sequence set forth as SEQ ID NO: 19). 

In another preferred embodiment, an isolated nucleic acid molecule is or 
includes a coaA gene, for example, a Bacillus {e.g., B. subtilis)-coaA gene, or portion or 
1 0 fragment thereof. Exemplary isolated coaA nucleic acid molecules and/or genes include 
(1) an isolated coaA nucleic acid molecule or gene comprising the nucleotide sequence 
as set forth in any one of SEQ ID NOs:20-22; (2) an isolated coaA nucleic acid molecule 
or gene comprising a nucleotide sequence that encodes the amino acid sequence as set 
forth in SEQ ID NO:3; (3) an isolated coaA nucleic acid molecule or gene comprising a 
1 5 nucleotide sequence which encodes a CoaA homologue (e.g. , a polypeptide having an 
amino acid sequence at least about 30-35%, preferably at least about 35-40%, more 
preferably at least about 40-50%; and even more preferably at least about 60%, 70%, 
80%, 90% or more identical to the amino acid sequence set "forth as SEQ IDNO:3 and 
having a substantially equivalent enzymatic activity; (4) an isolated coaA nucleic acid 
20 molecule or gene comprising a nucleotide sequence that encodes a polypeptide as set 
forth in any one of SEQ ID NO:l, SEQ lD NO:2,SEQ IDNO:4,SEQ lDNO:5 or SEQ 
ID NO:6; (5) an isolated nucleic acid molecule that hybridizes under stringent 
conditions to all or a portion of a nucleic acid molecule having the nucleotide sequence 
set forth in SEQ ID NO:20, SEQ ID NO:21 or SEQ ID NO:22 or hybridizes to all or a 
25 portion of a nucleic acid molecule having a nucleotide sequence that encodes a 
polypeptide having the amino acid sequence of SEQ ID NO:3; and (6) an isolated 
nucleic acid molecule comprising a nucleotide sequence that is complementary to a 
coaA nucleotide sequence as set forth herein (e.g., is the full complement of the 
nucleotide sequence set forth in SEQ ID NO:20, SEQ ID NO:21 or SEQ ID NO:22). 
30 A nucleic acid molecule of the present invention {e.g. , a coaX nucleic acid 

molecule or gene or a coaA nucleic acid molecule or .gene), can be isolated using 
standard molecular biology techniques and the sequence information provided herein. 
For example, nucleic acid molecules can be isolated using standard hybridization and 
cloning techniques (e.g., as described in Sambrook, J., Fritsh, E. F., and Maniatis, T. 
35 Molecular Cloning: A Laboratory Manual. 2nd, ed, Cold Spring Harbor Laboratory, 
Cold Spring Harbor Laboratory Press, Cold Spring Harbor, MY, 1989) or can be isolated 
by the polymerase chain reaction using synthetic oligonucleotide primers designed 
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based upon the coaXox coaA nucleotide sequences set forth herein, or flanking 
sequences thereof. A nucleic acid of the invention (e.g., a coaX nucleic acid molecule or 
gene or a coaA nucleic acid molecule or gene), can be amplified using cDNA, mRNA or 
alternatively, chromosomal DNA, as a template and appropriate oligonucleotide primers 
5 according to standard PCR amplification techniques. 

Yet another embodiment of the present invention features mutant coaX and coaA 
nucleic acid molecules or genes. The phrase "mutant nucleic acid molecule" or "mutant 
gene" as used herein, includes a nucleic acid molecule or gene having a nucleotide 
sequence which includes at least one alteration (e.g., substitution, insertion, deletion) 

1 0 such that the polypeptide or protein that may be encoded by said mutant exhibits an , 
activity that differs from the polypeptide or protein encoded by the wild-type nucleic 
acid molecule or gene. Preferably, a mutant nucleic acid molecule or mutant gene (e.g., 
a mutant coaA or cooYgene) encodes a polypeptide or protein having a reduced activity 
(e.g., having a reduced pantothenate kinase activity) as compared to the polypeptide or 

1 5 protein encoded by the wild-type nucleic acid molecule or gene, for example, when 
assayed under similar conditions (e.g., assayed in microorganisms cultured at the same 
temperature). A mutant gene also can encode no polypeptide or have a reduced level of 
production of the wild-type polypeptide. 

As used herein, a "reduced activity" or "reduced enzymatic activity" is one that 

20 is at least 5% less than that of the polypeptide or protein encoded by the wild-type 
nucleic acid molecule or gene, preferably at least 5-10% less, more preferably at least 
10-25% less and even more preferably at least 25-50%, 50-75% or 75-100% less than 
that of the polypeptide or protein encoded by the wild-type nucleic acid molecule or 
gene. Ranges intermediate to the above-recited values, e.g., 75-85%, 85-90%, 90-95%, 

25 are also intended to be encompassed by the present invention. As used herein, a 
"reduced activity" or "reduced enzymatic activity" also includes an activity that has 
been deleted or "knocked out" (e.g., approximately 100% less activity than that of the 
polypeptide or protein encoded by the wild-type nucleic acid molecule or gene). 
Activity can be determined according to any well accepted assay for measuring activity 

30 of a particular protein of interest. Activity can be measured or assayed directly, for 
example, measuring an activity of a protein isolated or purified from a cell. 
Alternatively, an activity can be measured or assayed within a cell or in an extracellular 
medium. For example, assaying for a mutant coaA gene or a mutant coaA' gene (i.e., 
said mutant encoding a reduced pantothenate kinase activity) can be accomplished by 

35 expressing the mutated gene in a microorganism, for example, a mutant microorganism 
which expresses pantothenate kinase in a temperature-sensitive manner, assaying the 
mutant gene for the ability to complement a temperature sensitive (Ts) mutant for 
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pantothenate kinase activity. A coaX mutant gene or coaA mutant, gene that encodes a 
"reduced pantothenate kinase activity" is one that complements the Ts. mutant less 
effectively than, for example, a corresponding wild-type coaAgene or coaA -gene. 

It will be appreciated by the skilled artisan that even a single substitution in a 
5 nucleic acid or gene sequence (e.g., a base substitution that encodes an amino acid 

change in the corresponding amino acid sequence) can dramatically affect the activity of 
an encoded polypeptide or protein as compared to the corresponding wild-type 
polypeptide or protein. A mutant nucleic acid or mutant gene (e.g., encoding a mutant 
polypeptide or protein), as defined herein, is readily distinguishable from a nucleic acid 
10 or gene encoding a protein homologue, as described above, in that a mutant nucleic acid 
or mutant gene encodes a protein or polypeptide having an altered activity, optionally 
observable as a different or distinct phenotype in a microorganism expressing said 
mutant gene or nucleic acid or producing said mutant protein or polypeptide (i.e., a 
mutant microorganism) as compared to a corresponding microorganism expressing the 
15 wild-type gene or nucleic acid or producing said mutant protein or polypeptide. By 
contrast, a protein homologue has an identical or substantially similar activity, 
optionally phenotypically indiscernable when produced in a microorganism, as 
compared to a corresponding microorganism expressing the wild-type gene or nucleic 
acid. Accordingly it is not, for example, the degree of sequence identity between 
20 nucleic acid molecules, genes, protein or polypeptides that serves to distinguish between 
homologies and mutants, rather it is the activity of the encoded protein or polypeptide 
that distinguishes between homologues and mutants: homologues having, for example, 
low (e.g., 30-50% sequence identity) sequence identity yet having substantially 
equivalent functional activities, and mutants, for example sharing 99% sequence identity 
25 yet having dramatically different or altered functional activities. Exemplary 

homologues are set forth in Figure 20 (/.e.,X3oaA homologues) and in Figure 23 (i.e., 
CoaX homologues). Exemplary mutants are described in Examples XV and XVIII 
herein. 

30 VIII. Recombinant Nucleic Acid Molecules and Vectors 

The present invention further features recombinant nucleic acid molecules <e,g., 
recombinant DNA molecules) that include nucleic acid molecules and/or ^enes 
described herein (e.g., isolated nucleic acid molecules and/or genes), preferably Bacillus 
genes, more preferably Bacillus subtilis genes, even more preferably Bacillus subtilis 

35 pantothenate kinase genes (e.g. , coaX genes or coaA genes), pantothenate biosynthetic 
genes te g., genes encoding pantothenate biosynthetic enzymes, for example, panB 
genes encoding ketopantoate hydroxymethyltransfera5e,/7an£^enes^ncoding 
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ketopantoate reductase, panC genes encoding pantothenate synthetase, and/or panD 
genes encoding aspartate-a-decarboxylase) and/or isoleucine-valine (//v) biosynthetic 
genes (e.g. ,ilvBN or alsS genes encoding acetohydroxyacid synthetase, ilvC genes 
encoding acetohydroxyacid isomeroreductase and/or ilvD genes encoding dihydroxyacid 

i 

5 dehydratase). 

The present invention further features vectors (e.g. , recombinant vectors) that 
include nucleic acid molecules (e.g., isolated or recombinant nucleic acid molecules 
and/or genes) described herein. In particular, recombinant vectors are featured that 
include nucleic acid sequences that encode bacterial gene products as described herein, 

1 0 preferably Bacillus gene products, more preferably Bacillus subtilis gene products, even 
more preferably Bacillus subtilis pantothenate biosynthetic gene products (e.g., 
pantothenate biosynthetic enzymes, for example, ketopantoate 
hydroxymethyltransferase, ketopantoate reductase, pantothenate synthetase, and/or 
aspartate-a-decarboxylase) and/or isoleucine-valine biosynthetic gene products (e.g., 

15 , acetohydroxyacid synthetase, acetohydroxyacid isomeroreductase and/or dihydroxyacid 
dehydratase). 

The term "recombinant nucleic acid molecule" includes a nucleic acid molecule 
(e.g., a DNA molecule) that has been altered, modified or engineered such that it differs 
in nucleotide sequence from the native or natural nucleic acid molecule from which the 
20 recombinant nucleic acid molecule was derived (e.g., by addition, deletion or 
substitution of one or more nucleotides). Preferably, a recombinant nucleic acid 
molecule (e.g., a recombinant DNA molecule) includes an isolated nucleic acid 
molecule or gene of the present invention (e.g., an isolated coaX, coaA, pan or ilv gene) 
operably linked to regulatory sequences. 

25 The term "recombinant vector" includes a vector (e.g., plasmid, phage, phasmid, 

virus, cosmid or other purified nucleic acid vector) that has been altered, modified or 
engineered such that it contains greater, fewer or different nucleic acid sequences than 
those included in the native or natural nucleic acid molecule from which the 
recombinant vector was derived. Preferably, the recombinant vector includes a coaX, 

30 coaA, pan or ilv gene or recombinant nucleic acid molecule including such coaX, coaA t 
pan or ilv gene, operably linked to regulatory sequences, for example, promoter 
sequences, terminator sequences and/or artificial ribosome binding sites (RBSs), as 
defined herein. 

The phrase "operably linked to regulatory sequence(s)" means that the 
35 nucleotide sequence of the nucleic acid molecule or gene of interest is linked to the 
regulatory sequence(s) in a manner which allows for expression (e.g., enhanced, 
increased, constitutive, basal, attenuated, decreased or repressed expression) of the 
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nucleotide sequence, preferably expression of a gene product encoded by the nucleotide 
sequence {e.g., when the recombinant nucleic acid molecule is included in a 
recombinant vector, as defined herein, and is introduced into a microorganism). 

The term "regulatory sequence" includes nucleic acid sequences which affect 
5 (e.g., modulate or regulate) expression of other nucleic acid sequences. In one 

embodiment, a regulatory sequence is included in a recombinant nucleic acid molecule 
or recombinant vector in a similar or identical position and/or orientation relative to a 
particular gene of interest as is observed for the regulatory sequence and gene of interest 
as it appears in nature, e.g., in a native position and/or orientation. For example, a gene 
1 0 of interest can be included in a recombinant nucleic acid molecule or recombinant vector 
operably linked to a regulatory sequence which accompanies or is adjacent to the^ene 
of interest in the natural organism (e.g., operably linked to "native" regulatory 
sequences, for example, to the "native" promoter). Alternatively, agene of interest ean 
be included in a recombinant nucleic acid molecule or recombinant vector operably 
1 5 linked to a regulatory sequence which accompanies or is adjacent to another {e.g. , a 

different) gene in the natural organism. Alternatively, a^ene of interest t:an be included 
in a recombinant nucleic acid molecule or recombinant vector operably linked to a 
regulatory sequence from another organism. For example, regulatory sequences from 
other microbes (e.g., other bacterial regulatory sequences, bacteriophage regulatory 
20 sequences and the like) can be operably linked to a particular gene of interest. 

In one embodiment, a regulatory sequence is a non-native or non-naturally- 
occurring sequence (e.g., a sequence which has been modified, mutated, substituted, 
derivatized, deleted including sequences which are chemically synthesized). Preferred 
regulatory sequences include promoters, enhancers, termination signals, anti-termination 
25 signals and other expression control elements {e g. , sequences to which repressors or 
inducers bind and/or binding sites for transcriptional and/or translational regulatory 
proteins, for example, in the transcribed mRN A). Such regulatory sequences are 
described, for example, in Sambrook, J., Fritsh, E. F., and Maniatis, T. Molecular 
Cloning: A Laboratory Manual. 2nd, ed, Cold Spring Harbor laboratory, Cold Spring 
30 Harbor Laboratory Press, Cold Spring Harbor, NY, 1 9S9. Regulatory sequences include 
those which direct constitutive expression of a nucleotide sequence in a microorganism 
(e.g:, constitutive promoters and strong constitutive promoters), those which direct 
inducible expression of a nucleotide sequence in a microorganism (e.g. , inducible 
promoters, for example, xylose inducible promoters) and those which attenuate or 
35 repress expression of a nucleotide sequence in a microorganism {e.g. , attenuation signals 
or repressor sequences). It is also within the scope of the present invention to regulate 
expression of a gene of interest by removing or deleting regulatory sequences. For 
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example, sequences involved in the negative regulation of transcription can be removed 
such that expression of a gene of interest is enhanced. 

In one embodiment, a recombinant nucleic acid molecule or recombinant vector 
of the present invention includes a nucleic acid sequence or gene that encodes at least 
5 one bacterial gene product (e.g., a pantothenate biosynthetic enzyme, an isoleucine- 
valine biosynthetic enzyme, or a CoaA biosynthetic enzyme, for example CoaA or 
CoaX) operably linked to a promoter or promoter sequence. Preferred promoters of the 
present invention include Bacillus promoters and/or bacteriophage promoters {e.g., 
bacteriophage which infect Bacillus). In one embodiment, a promoter is a Bacillus 
1 0 promoter, preferably a strong Bacillus promoter (e.g. , a promoter associated with a 
biochemical housekeeping gene in Bacillus or a promoter associated with a glycolytic 
pathway gene in Bacillus). In another embodiment, a promoter is a bacteriophage 
promoter. In a preferred embodiment, the promoter is from the bacteriophage SP01 . In 
a particularly preferred embodiment, a promoter is selected from the group consisting of 
1 5 P i5 , P 2 6 or Pveg, for example, the promoters set forth in SEQ ID NO:39, SEQ ID NO:40 
or SEQ ID NO:41 . Additional preferred promoters include te/{the translational 
elongation factor (TEF) promoter) and pyc (the pyruvate carboxylase (PYC) promoter), 
which promote high level expression in Bacillus (e.g., Bacillus subtilis). Additional 
preferred promoters, for example, for use in Gram positive microorganisms include, but 
20 are not limited to, the amyE promoter or phage SP02 promoters. Additional preferred 
promoters, for example, for use in Gram negative microorganisms include, but are not 
limited to tac, trp, tet, trp-tet, lpp t lac, lpp-lac> laclq, T7, T5, T3, gal, trc, ara, SP6, X-P R 
orX-P L . 

In another embodiment, a recombinant nucleic acid molecule or recombinant 
25 vector of the present invention includes a terminator sequence or terminator sequences 
(e.g. , transcription terminator sequences). The term "terminator sequences" includes 
regulatory sequences which serve to terminate transcription of a gene. Terminator 
sequences (or tandem transcription terminators) can further serve to stabilize mRNA 
(e.g., by adding structure to mRNA), for example, against nucleases. 
30 In yet another embodiment, a recombinant nucleic acid molecule or recombinant 

vector of the present invention includes sequences which allow for detection of the 
vector containing said sequences (i.e., detectable and/or selectable markers), for 
example, sequences that overcome auxotrophic mutations, for example, ura3 or //v£, 
fluorescent markers, and/or colorimetric markers (e.g., /acZ/0-galactosidase), and/or 
35 antibiotic resistance genes (e.g. , amp or tet). 
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In yet another embodiment, a recombinant nucleic acid molecule or recombinant 
vector of the present invention includes an artificial ribosome binding site (RBS). The 
term "artificial ribosome binding site (RBS)" includes a site within an mRNA molecule 
(e.g., coded within DNA) to which a ribosome binds <e.g., to initiate translation) which 
differs from a native RBS (e.g., a RBS found in a naturally-occurring gene) by at least 
one nucleotide. Preferred artificial RBSs include about 5-6, 7-8, 9-10, 1 1-12, 13-14, 15- 
16, 17-18, 19-20, 21-22, 23-24, 25-26, 27-28, 29-30 or more nucleotides of which about 
1-2, 3-4, 5-6, 7-8, 9-10, 1 1-12, 13-15 or more differ from the native RBS (e.g., the 
native RBS of a gene of interest). Preferably, nucleotides which differ are substituted, 
such that they are identical to one or more nucleotides of an ideal RBS (e.g., SEQ ID 
NO:44, SEQ ID NO:45, SEQ ID NO:46, SEQ ID NO:47 or SEQ ID NO:48), when 
optimally aligned for comparisons. Artificial RBSs can be used to replace the naturally- 
occurring or native RBS associated with a particular gene. Artificial RBSs preferably 
increase translation of a particular gene. Preferred artificial RBSs (e.g., RBSs for 
increasing the translation of panB, for example, of B. subtilis panB) are depicted in 
Table I A (e.g. , SEQ ID NO:49 and SEQ ID MO:50). 



20 



Table I A: Preferred panB Ribosome Binding Sites 
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AGAAAGGAGGTGA 

CCCTCT - AG - AAGGAGGAGAAAACATG 
CCCTCT- AG- - AGGAGGAGAAAAC ATG 
25 TAAACAT -G- - AGGAGGAGAAAACATG 



ideal RBS (SEQ ID NO: 44) 

RBS1 (SEQ ID NO:49) 

RBS2 <SEQ ID NO:S0) 

panB native RBS (SEQ ID NO: 42) 



30 



Additional preferred artificial RBSs (e.g., RBSs for increasing the translation of 
panD, for example, of B. subtilis panD) are depicted in Table IB (e.g., SEQ ID NO:51, 
SEQ ID NO:52, SEQ ID NO:53 and SEQ ID NOS4). 

Table IB: Preferred panD Ribosome Binding Sites 
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CTAGAAA AGGAGGA ATTTAAATG 
TTAAGAA AGGAGGT GANNNNATG 

TTAGAA AGGAGGA TTTAAATATG 
TTAGAA AGGAGG TTTAATTAATG 
TTAGAAAGGAGGTGATTTAAATG 
TTAGAAAGGAGGTGTTTAAAATG 
TTAGAAAGGAGGTGANNNNNATG 



pAN423 RBS -(SEQ ID NO -.88) 
ideal RBS <SEQ ID NO:4S) 

new design A (SEQ ID NO: SI) 
new design B (SEQ ID NO:S2) 
new design CI (SEQ ID NO: S3) 
new design C2 (SEQ ID NO:S4) 
ideal RBS (SEQ ID NO: 46) 
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Additional preferred artificial RBSs<e.g., RBSs for increasing the translation of 
panD, for example, of B. subtilis panD) are depicted in Table 1C (e.g. , SEQ ID NO:55, 
SEQIDNO:56andSEQIDNO:57). The predicted amino acid sequence at the C- 
5 terminus of the PanC protein is shown. The start codon for PanD translation is 
underlined. 



10 



15 
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Table 1C: Additional Preferred panD Ribosome Binding Sites 

10 20 



--A GAA AGG AGG TGA NNN NNN N ATG ideal RBS (SEQ ID NO:47) 

ATT CGA GAA ATG GAG AGA ATA TAA T ATG native panD RBS (SEQ ID NO: 43) 

lie Arg Glu Met Glu Arg lie * Met SEQ ID NO: 89 

♦ 

--A GAA AGG AGG TGA NNN NNN N ATG ideal RBS (SEQ ID NO:47) 

ATT CGA GAA AGG AGG TGA ATA TAA T ATG NDI <SEQ ID NO: 55) 

lie Arg Glu Arg Arg * Met SEQ ID NO: 90 

ATT CGA GAA AGG AGG TGA ATA ATA - ATG NDI I (SEQ ID NO: 56) 

lie Arg Glu Arg Arg * Met SEQ ID NO: 90 

ATT CGT AGA AAG GAG GTG AAT TAA T ATG NDIII (SEQ ID NO: 57) 

25 lie Arg Arg Lys Glu Val Asn * Met SEQ ID NO: 91 

AGA AAG GAG GTG ANN NNN N ATG ideal RBS (SEQ ID NO: 48) 



30 Accordingly, in one embodiment, a vector of the present invention includes an 

artificial RBS as set forth in SEQ ID NO:49 or SEQ ID NO:50. In another embodiment, 
a vector of the present invention includes an artificial RBS as set forth in SEQ ID 
NO:51, SEQ ID NO:52, SEQ ID NO:53 or SEQ ID NO:54. In yet another embodiment, 
a vector of the present invention includes an artificial RBS as set forth in SEQ ID 

35 NO:55, SEQ ID NO:56 or SEQ ID NO:57. 

In another embodiment, a recombinant vector of the present invention includes 
sequences that enhance replication in bacteria (e.g., replication-enhancing sequences). 
In one embodiment, replication-enhancing sequences are derived from E. coli. In 
another embodiment, replication-enhancing sequences are derived from pBR322 (e.g., 

40 sequences included within the pBR322 derived portion of any of the pAN vectors as set 
forth in the Figures , i.e., the Not I-Not I sequences from about 5.0 kB to 9.0 kB of the 
vector depicted in Figure 3A). 
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In yet another embodiment, a recombinant vector of .the present invention 
includes antibiotic resistance genes. The term "antibiotic resistance -genes" includes 
sequences which promote or confer resistance to antibiotics on the host organism {eg., 
Bacillus). In one embodiment, the antibiotic resistance genes architected from the 

5 group consisting of cat (chloramphenicol resistance) -genes, let (tetracycline resistance) 
genes, erm (erythromycin resistance) genes, neo (neomycin resistance). genes and spec 
(spectinomycin resistance) genes. Recombinant vectors of the present invention can 
further include homologous recombination sequences (e.g., sequences designed to allow 
recombination of the gene of interest into thechromosome of the host organism). For 

1 0 example, amyE sequences can be used as homology targets for recombination into the 
host chromosome. 

Preferred vectors of the present invention include, but are not limited to, vectors 
set forth in Figures 2-15, 17, 19, 22, 25 and 26. It will further be appreciated by one of 
skill in the art that the design of a vector can be tailored depending on such factors as the 
1 5 choice of microorganism to be genetically engineered, the level of expression of gene 
product desired and the like. 

IX. Isolated Proteins 
Another aspect of the present invention features isolated proteins (e.g., isolated 
20 pantothenate biosynthetic enzymes and/or valine-isoleucine biosynthetic -enzymes and/or 
isolated CoA biosynthetic enzymes, for example isolated CoaA or CoaX). In one 
embodiment, proteins (e.g., isolated pantothenate biosynthetic enzymes and/or valine- 
isoleucine biosynthetic enzymes and/or isolated CoaA biosynthetic enzymes, for 
example isolated CoaA or CoaX) are produced by recombinant DNA techniques and can 
25 be isolated from microorganisms of the present invention by an appropriate purification 
scheme using standard protein purification techniques. In another -embodiment, proteins 
are synthesized chemically using standard peptide synthesis techniques. 

An "isolated" or "purified" protein (e.g., an isolated or purified biosynthetic 
enzyme) is substantially free of cellular material or other contaminating proteins -from 
30 the microorganism from which the protein is derived, or substantially free from 
chemical precursors or other chemicals when chemically synthesized. In one 
embodiment, an isolated or purified protein has less than about 30% (by dry weight) of 
contaminating protein or chemicals, more preferably less than about 20% of 
contaminating protein or chemicals, still more preferably less than about 10% of 
3 5 contaminating protein or chemicals, and most preferably less than about 5% 
contaminating protein or chemicals. 
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In a preferred embodiment, the protein or gene product is derived from Bacillus 
(e.g., is Bacillus-derived). The term "derived from Bacillus" or "£<ic/7/wj-derived" 
includes a protein or gene product which is encoded by a Bacillus gene. Preferably, the 
gene product is derived from a microorganism selected from the group consisting of 
5 Bacillus subtilis, Bacillus lentimorbus, Bacillus lentus, Bacillus firmus, Bacillus 

pantothenticus, Bacillus amyloliquefaciens, Bacillus cereus, Bacillus circulars, Bacillus 
coagulans, Bacillus licheniformis, Bacillus megalerium, Bacillus pumilus, Bacillus 
thuringiensis, and other Group 1 Bacillus species, for example, as characterized by 16S 
rRNA type (Priest, supra). In another preferred embodiment, the protein or gene 

1 0 product is derived from Bacillus brevis or Bacillus stearothermophilus. In another 
preferred embodiment, the protein or gene product is derived from a microorganism 
selected from the group consisting of Bacillus licheniformis, Bacillus amyloliquefaciens, 
Bacillus halodurans, Bacillus subtilis, and Bacillus pumilus. In a particularly preferred 
embodiment, the protein or gene product is derived from Bacillus subtilis (e.g., is 

1 5 Bacillus subtilis-derived). The term "derived from Bacillus subtilis" or "Bacillus 
subtilis-derived" includes a protein or gene product which is encoded by a Bacillus 
subtilis gene. In yet another preferred embodiment, the protein or gene product is 
encoded by a Bacillus gene homologue (e.g., a gene derived from a species distinct from 
Bacillus but having significant homology to a Bacillus gene of the present invention, for 

20 example, a Bacillus pan gene or Bacillus ilv gene). 

Included within the scope of the present invention are bacterial derived proteins 
or gene products and/or Bacillus-derived proteins or gene products (e.g., B. subtilis- 
derived gene products) that are encoded by naturally-occurring bacterial and/or Bacillus 
genes (e.g.,B. subtilis genes), for example, the genes identified by the present inventors, 

25 for example, Bacillus or B. subtilis coaX genes, codA genes, pan genes and/or ilv genes. 
Further included within the scope of the present invention are bacterial-derived proteins 
or gene products and/or Bacillus-derived proteins or gene products (e.g., B. subtilis- 
derived gene products) that are encoded bacterial and/or Bacillus genes (e.g., B. subtilis 
genes) which differ from naturally-occurring bacterial and/or Bacillus genes (e.g., B. 

30 subtilis genes), for example, genes which have nucleic acids that are mutated, inserted or 
deleted, but which encode proteins substantially similar to the naturally-occurring gene 
products of the present invention. For example, it is well understood that one of skill in 
the art can mutate (e.g., substitute) nucleic acids which, due to the degeneracy of the 
genetic code, encode for an identical amino acid as that encoded by the naturally- 

35 occurring gene. Moreover, it is well understood that one of skill in the art can mutate 
(e.g., substitute) nucleic acids which encode for conservative amino acid substitutions. 
It is further well understood that one of skill in the art can substitute, add or delete 
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amino acids to a certain degree without substantially affecting the function of a gene 
product as compared with a naturally-occurring gene product, each instance of which is 
intended to be included within the scope of the present invention. 

In a preferred embodiment, an isolated protein of the present invention {e.g., an 
5 isolated pantothenate biosynthetic enzyme and/or an isolated isoleueine-valine 
bibsynthetic enzyme and/or an isolated CoaA biosynthetic enzymes, for example 
isolated CoaA or CoaX) has an amino acid sequence shown in SEQ IDNO:3, SEQ ID 
NO:9, SEQ ID NO:24, SEQ ID NO:26, SEQ ID NO:28, SEQ ID NO:30, SEQ ID 
NO:32, SEQ ID NO:34, SEQ ID NO:36, SEQ ID NO:38 or SEQ ID NO:87. In other 
1 0 embodiments, an isolated protein of the present invention is a homologue of the at least 
one of the proteins set forth as SEQ ID NO:3, SEQ ID NO:9, SEQ ID NO:24,SEQ ID 
NO:26, SEQ ID NO:28, SEQ ID NC-.30, SEQ ID NO.32, SEQ ID NO:34, SEQ ID 
NO:36, SEQ ID NO:38 or SEQ ID NO:87<e.£., comprises an amino acid sequence at 
least about 30-40% identical, preferably about 40-50% identical, more preferably about 
1 5 50-60% identical, and even more preferably about 60-70%, 70-80%, 80-90%, 90-95% or 
more identical to the amino acid sequence of SEQ ID NO:3, SEQ ID NO:9, SEQ ID 
N0.24, SEQ ID NO:26, SEQ ID NO:28,T?EQ ID NO:30,^EQ ID NO:32,"SEQ ID 
NO:34, SEQ ID NO:36,5EQ ID NO:38 or SEQ ID NO:87, and has an activity that is 
substantially similar to that of the protein encoded by the amino acid-sequence of SEQ 
20 IDNO:3,SEQIDNO:9,SEQIDNO:24,SEQIDNO:26,SEQIDNO:28,'S£QID 
NO:30, SEQ ID NO:32, SEQ ID NO:34,^EQ ID NO:36, SEQ ID NO:38 or SEQ ID 
NO:87, respectively. 

To determine the percent homology of two amino acid sequences or of two 
nucleic acids, the sequences are aligned for optimal comparison purposes <e.g., gaps can 
25 be introduced in the sequence of a first amino acid or nucleic acid sequence for optimal 
alignment with a second amino or nucleic acid sequence). When a position in the first 
sequence is occupied by the same amino acid residue or nucleotide as the corresponding 
position in the second sequence, then the molecules are identical at that position. The 
percent identity between the two sequences is a function of the number of identical 
30 positions shared by the sequences </. e. , % identity = # of identical positions/total # of 
positions x 1 00), preferably taking into account the number of gaps and size ofsaid gaps 
necessary to produce an optimal alignment. 

The comparison of sequences and determination of percent homology between 
two sequences can be accomplished using a mathematical algorithm. A preferred, non- 
3 5 limiting example of a mathematical algorithm utilized for the comparison of sequences 
is the algorithm of Karlin and Altschul (1990) Proc. Natl. Acad. Sci. USA 87:2264-68, 
modified as in Karlin and Altschul (1993) Proc. Natl. Acad. Sci. USA 90:5873-77. Such 
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an algorithm is incorporated into the NBLAST and XBLAST programs (version 2.0) of 
Altschul et al (1990) J. Mol Biol 215:403-10. BLAST nucleotide searches can be 
performed with the NBLAST program, score = 100, wordlength = 12 to obtain 
nucleotide sequences homologous to nucleic acid molecules of the invention. BLAST 

5 protein searches can be performed with the XBLAST program, score - 50, wordlength = 
3 to obtain amino acid sequences homologous to protein molecules of the invention. To 
obtain gapped alignments for comparison purposes, Gapped BLAST can be utilized as 
described in Altschul et al (1997) Nucleic Acids Research 25(17):3389-3402. When 
utilizing BLAST and Gapped BLAST programs, the default parameters of the respective 

10 programs (e.g., XBLAST and NBLAST) can be used. See http://www.ncbi.nlm.nih.gov. 
Another preferred, non-limiting example of a mathematical algorithm utilized for the 
comparison of sequences is the algorithm of Myers and Miller (1988) Comput Appl 
Biosci. 4:11-17. Such an algorithm is incorporated into the ALIGN program available, ♦ 
for example, at the GENESTREAM network server, IGH Montpellier, FRANCE 

1 5 (http://vega.igh.cnrs.fr) or at the ISREC server (http://www.ch.embnet.org). When 
utilizing the ALIGN program for comparing amino acid sequences, a PAM120 weight 
residue table, a gap length penalty of 12, and a gap penalty of 4 can be used. 

In another preferred embodiment, the percent homology between two amino acid 
sequences can be determined using the GAP program in the GCG software package 

20 (available at http://www.gcg.com), using either a Blossom 62 matrix or a PAM250 
matrix, and a gap weight of 12, 10, 8, 6, or 4 and a length weight of 2, 3, or 4. In yet 
another preferred embodiment, the percent homology between two nucleic acid 
sequences can be accomplished using the GAP program in the GCG software package 
(available at http://www.gcg.com), using a gap weight of 50 and a length weight of 3. 

25 

X. Biotransformations and Bidconversions 

Another aspect of the present invention includes biotransformation processes 
which feature recombinant microorganisms (e.g., mutant microorganisms) and/or 
isolated CoA, pantothenate or isoleucine-valine biosynthetic enzymes described herein. 

30 The term "biotransformation process", also referred to herein as "bioconversion 
processes", includes biological processes which result in the production {e.g., 
transformation or conversion) of any compound (e.g., intermediate or product) which is 
upstream of a CoA, pantothenate or isoleucine-valine biosynthetic enzyme to a 
compound (e.g., substrate, intermediate or product) which is downstream of said CoA, 

35 pantothenate or isoleucine-valine biosynthetic enzyme. 
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In one embodiment, the invention features a biotransformation process for the 
production of a panto-compound comprising contacting a microorganism which 
overexpresses at least one pantothenate biosynthetic enzyme with at least one 
appropriate substrate or precursor under conditions such that said panto-compound is 
5 produced and recovering said panto-compound.- In a preferred embodiment, the 

invention features a biotransformation process for the production of pantoate comprising 
contacting a microorganism which overexpresses ketopantoate reductase (the panE gene 
product) with an appropriate substrate {e.g., ketopantoate) under conditions such that 
pantoate is produced and recovering said pantoate. In another preferred embodiment, 
1 0 the invention features a biotransformation process Tor the production of pantothenate 
comprising contacting a microorganism which overexpresses ketopantoate reductase and 
pantothenate synthetase with appropriate substrates {e.g., ketopantoate and p-alanine) 
under conditions such that pantothenate is produced and recovering said pantothenate. 
In yet another preferred embodiment, the invention features a biotransformation process 
1 5 for the production of pantothenate comprising contacting a microorganism which 
overexpresses ketopantoate hydroxymethyltransferase, ketopantoate reductase and 
pantothenate synthetase with appropriate substraies<eg., a-ketoisovalerate and P- 
alanine) under conditions such that pantothenate is produced and recovering said 
pantothenate. Preferred recombinant microorganisms for carrying out the above- 
20 described biotransformations include pantothenate kinase mutants. Conditions under 
which pantoate or pantothenate are produced can include any conditions which result in 
the desired production of pantoate or pantothenate, respectively. 

In yet another embodiment, the present invention includes a method of 
producing p-alanine that includes culturing a microorganism which overexpresses 
25 aspartate-a-decarboxylase under conditions such that p-alanine is produced. Preferably, 
the aspartate-a-decarboxylase-overexpressing microorganism has a mutation in a nucleic 
acid sequence encoding a pantothenate biosyntheticenzyme selected from the group 
consisting of ketopantoate hydroxymethyltransferase, ketopantoate reductase and 
pantothenate synthetase. 
30 The invention further features a method of producing p-alanine that includes 

contacting a composition comprising aspartate with an isolated Bacillus aspartate-a- 
decarboxylase enzyme under conditions such that P-alanine is produced (e.g., an in vitro 
synthesis method). 

The microorganism(s) and/or enzymes used in the biotransformation reactions 
35 are in a form allowing them to perform their intended function (e.g., producing a desired 
compound). The microorganisms can be whole cells, or can be only those portions of 
the cells necessary to obtain the desired end result. The microorganisms -can be 
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suspended (e.g., in an appropriate solution such as buffered solutions or media), rinsed 
(e.g., rinsed free of media from culturing the microorganism), acetone-dried, 
immobilized (e.g., with polyacrylamide gel or k-carrageenan or on synthetic supports, 
for example, beads, matrices and the like), fixed, cross-linked or permeablized (e.g., 
5 have permeablized membranes and/or walls such that compounds, for example, 
substrates, intermediates or products Can more easily pass through said membrane or 
wall). 

Purified or unpurified CoA biosynthetic enzyme(s) (e.g., CoaA and/or CoaX), 
pantothenate biosynthetic enzyme(s) and/or valine-isoleucine biosynthetic enzyme(s) 

10 can also be used in biotransformation reactions. The enzyme can be in a form that 
allows it to perform its intended function (e.g. , obtaining the desired compound). For 
example, the enzyme can be in free form or immobilized. Purified or unpurified CoA 
biosynthetic enzyme(s), pantothenate biosynthetic enzyme(s) arid/or valine-isoleucine 
biosynthetic enzyme(s) can be contacted in one or more in vitro reactions with 

1 5 appropriate substrate(s) such that the desired product is produced. 

With respect to at least the above-described methodologies (e.g., the production 
methodologies of the present invention), at least one aspect of the invention features the 
folowing: embodiments is which the methods do not use microorganisms of the genus 
Corynebacterium and/or microorganisms of the genus Escherichia', embodiments in 

20 which the methods do not use microorganisms selected from the group consiting of 

Escherichia coli and Corynebacterium glutamicitm; embodiments in which the methods 
do not use gram negative microorganisms; embodiments in which the microorganisms 
utilized do not include, express or produce nucleic acid molecules, genes or proteins 
(e.g., biosynthetic emzymes) derived from microorganisms of the genus 

25 Corynebacterium and/or microorganisms of the genus Escherichia', embodiments in 
which the microorganisms to not include, express or produce nucleic acid molecules, 
genes or proteins (e.g., biosynthetic emzymes) derived from microorganisms selected 
from the group consisting of Escherichia coli and Corynebacterium glutamicum. 

30 XI. Screening Assays 

Because CoA is an essential factor in bacteria, proteins (e.g., enzymes) involved 
in the biosynthesis of CoA provide valuable tools in the search for novel anti-biotics. In 
particular, the CoaX protein is a valuable target for identifying bacteriocidal compounds 
because it bears no resemblance in primary sequence to mammalian pantothenate kinase 

35 enzymes. Accordingly, the present invention also provides a method (also referred to 
herein as a "screening assay") for identifying modulators, i.e., candidate or test 
compounds or agents (e.g., peptides, peptidomimetics, small molecules or other drugs) 
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which bind to CoaX, or have a, stimulatory or inhibitory effect on, for example, -coaX 
expression or CoaX activity. 

In one embodiment, the invention provides assays for screening candidate or test 
compounds which are capable of binding to CoaX proteins or a biologically active 

5 portion thereof. In another embodiment, the invention provides assays for screening 
candidate or test compounds which modulate the activity of CoaX proteins or 
biologically active portions thereof. The test compounds of the present invention can be 
obtained using any of the numerous approaches in combinatorial library methods Icnown 
in the art, including: biological libraries; spatially addressable parallel "solid phase or 

1 0 solution phase libraries; synthetic library methods requiring deconvolution; the 'one- 
bead one-compound' library method; and synthetic library methods using affinity 
chromatography selection. The biological library approach is limited to peptide 
libraries, while the other four approaches are applicable to peptide, non-peptide 
oligomer or small molecule libraries of -compounds (Lam, K.S. (1997) Anticancer Drug 

15 Des. 12:145). 

Examples of methods for the synthesis of molecular libraries can be found in the 
art, for example in: DeWitt et al. (1993) Proc. Nail. Acad Sci. U.S.A. 90*909; Erb et 
al (1994) Proc. Natl. Acad. Sci. USA 91:il422; Zuckermanncrfl/. (1994). J. Med. 
Chem. 37:2678; Choe/ a/. (1993) Science 261:1303; Carrell et al. (1994) Angew. Chem. 
20 Int. Ed. Engl. 33 :2059; Carell et al. <1 994) Angew. Chem. Int. Ed. Engl. 33 :2061 ; and in 
Gallop etal. (1994)./. Med. Chem. 37:1233. Libraries of compounds may be presented 
in solution (e.g., Houghten (1992) Biotechniques 13:412-421), or on beads<Lam (4991) 
Nature 354:82-84), chips (Fodor (1993) Nature 364:555-556), bacteria (Ladner USP 
5,223,409), spores (Ladner USP '409), plasmids{Cull et al. (1992) Proc Natl AcadSci 
25 USA 89:1865-1869) or on phage (Scott and Smith (1990) Science 249:386-390); <Devlin 
(1990) Science 249:404-406); (Cwirla et al. (1990) Proc. Natl. Acad. Sci. 87:6378- 
6382); (Felici (1991) J. Mol. Biol. 222:30 1 -3 1 0); (Ladner supra.). - 

In one embodiment, an assay is a microorganism-based assay in which a 
recombinant microorganism which expresses a CoaX protein or biologically active 
30 portion thereof is contacted with a test compound and the ability of the test compound to 
modulate CoaX activity is determined. Determining the ability of the test compound to 
modulate CoaX activity can be accomplished by monitoring, for example, intracellular 
phosphopahthoate or CoA concentrations or secreted pantothenate concentrations (as 
compounds that inhibit CoaX will result in a buildup of pantothenate in the test 
35 microorganism). CoaX substrate can be labeled with a radioisotope or enzymatic label 
such that modulation of CoaX activity can be determined by detecting a conversion of 
labeled substrate to intermediate or product. For example, CoaX substrates-can be 
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labeled with 32p 5 ^C, or ^H, either directly or indirectly, and the radioisotope detected 
by direct counting of radioemmission or by scintillation counting. Determining the 
ability of a compound to modulate CoaX activity can alternatively be determined by 
detecting the induction of a reporter gene (comprising a CoA-responsive regulatory 

5 element operatively linked to a nucleic acid encoding a detectable marker, e.g., 
luciferase), or detecting a CoA-regulated cellular response. 

In yet another embodiment, a screening assay of the present invention is a cell- 
free assay in which a CoaX protein or biologically active portion thereof is contacted 
with a test compound in vitro and the ability of the test compound to bind to or modulate 

1 0 the activity of the CoaX protein or biologically active portion thereof is determined. In 
a preferred embodiment, the assay includes contacting the CoaX protein or biologically 
active portion thereof with known substrates to form an assay mixture, contacting the 
assay mixture with a test compound, and determining the ability of the test compound to 
modulate enzymatic activity of the CoaX on its substrates. , 

1 5 . Screening assays can be accomplished in any vessel suitable for containing the 

microorganisms, proteins, and/or reactants. Examples of such vessels include microtiter 
plates, test tubes, and micro-centrifuge tubes. In more than one embodiment of the 
above assay methods of the present invention, it may be desirable to immobilize either 
CoaX protein or a recombinant microorganism expressing CoaX protein to facilitate 

20 separation of products and/or substrates, as well as to accommodate automation of the 
assay. For example, glutathione-S-transferase/CoaX fusion proteins can be adsorbed 
onto glutathione sepharose beads (Sigma Chemical, St. Louis, MO) or glutathione 
derivatized microtiter plates. Other techniques for immobilizing proteins on matrices 
(e.g., biotin-conjugation and streptavidin immobilization or antibody conjugation) can 

25 also be used in the screening assays of the invention. 

In another embodiment, modulators of CoaX expression are identified in a 
method wherein a cell is contacted with a candidate compound and the expression of 
coaX mRNA or CoaX polypeptide in the cell is determined. The level of expression in 
the presence of the candidate compound is compared to the level of expression in the 

30 absence of the candidate compound (or to a suitable control, for example, an appropriate 
buffer control or standard). The candidate compound can then be identified as a 
modulator of coaXmRNA or CoaX polypeptide expression based on this comparison. 

This invention further pertains to novel agents identified by the above-described 
screening assays. Accordingly, it is within the scope of this invention to further use an 

35 agent identified as described herein in an appropriate animal model. For example, an 
CoaX modulating agent identified as described herein (e.g., an anti-bactericidal 
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compound) can be used in an infectious animal model to determine the -efficacy, 
toxicity, or side effects of treatment with such an agent. 

this invention is further illustrated by the following examples which should not 
be construed as limiting. The contents of all references, patents, patent applications 

5 (including U.S. Patent Application Serial No. 09/400,494, Filed September 21, 1999 
(pending), provisional U.S. Patent Application Serial No. 60/210,072, filed June 7, 
2000, provisional U.S. Patent Application Serial No. 60/221,938, filed July 28, 2000 and 
provisional U.S. Patent Application Serial No. 60/227,860, filed August 24, 2000, to 
which this application relates) and published patent applications cited throughout this 

1 0 application are incorporated herein by reference. 
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EXAMPLES 

General Methodology: 

Strains. Bacillus subtilis strains of the present invention are generally derived 
• from either of two strains. The first is variously named "168", "1 Al", or "RL-1". The 
5 genotype is trpC2. This strain was derived from the wild type "Marburg" strain by 
mutagenesis and has been the basis of much of the molecular biology work done on B. 
subtilis. The second strain is PY79, a prototrophic derivative of 168 that was made Trp* 
by transduction from the wild type strain W23. 

Media. Standard minimal medium for B. subtilis is comprised of 1 x Spizizen 
1 0 salts and 0.5% glucose. Standard solid "rich medium" is Tryptone Blood Agar Broth 
(Difco), and standard liquid "rich medium" is VY, a mixture of veal infusion broth and 
yeast extract. For testing production of pantothenate in liquid test tube cultures, an 
enriched form of VY, called "Special VY" or "SVY" is used. For batch fermentations, 
, SVYG and PFMG are used. The compositions of these media are given below. 

15 VY,a rich liquid medium: 25 g Difco Veal Infusion Broth, 5 g Difco Yeast 

Extract, 1L water (autoclave). 

TBAB, a rich solid medium: 33 g Difco Tryptone Blood Agar Broth, 1 L water 
(autoclave). 

MIN, a minimal medium: 100 ml 10 x Spizizen salts; 10 ml 50% glucose; 2 ml 
20 10% arginine HC1*; 10 ml 0.8% tryptophan**; water to 1 liter. (*In some cases, 
arginine is omitted or replace by sodium glutamate at 0.04% final concentration. In 
general, B. subtilis grows faster in minimal medium when certain amino acids, such as 
arginine, glutamine, glutamate, or proline, are added as an auxiliary nitrogen source. ) 
(**For strains that are tryptophan auxotrophs, tryptophan is routinely added to most 
25 minimal media.) 

10 x Spizizen Salts: 1 74 g K 2 HP0 4 -3H 2 0; 20 g (NH^SO.; 60 g KH 2 P0 4 ; 1 0 g 
Na 3 Citrate-2H 2 0; 2 g MgS(V7H 2 0; water to 993 mis; then add 3.5 ml F*C1 3 solution and 
3.5 ml Trace Elements solution. 

FeCl .Solution: 4 g FeCl 3 -6H 2 0; 197 g Na 3 Citrate-2H 2 0; water to 1 liter (filter 
30 sterilize) 

Trace Elements Solution: 0.1 5 g Na 2 Mo0 4 -2H 2 0; 2.5 g H 3 B0 3 ; 0.7 g 
CoCl 2 -6H 2 0; 0.25 g CuS0 4 -5H 2 0; 1 .6 g MnCl 2 -4H 2 0; 0.3 g ZnS0 4 -7H 2 0; water to 1 liter 
(filter sterilize). 
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SVY, Special VY f a supplemented* rich medium for testing pantothenate 
production in test tube cultures: 25 g Difco Veal inftcion Broth; *5 g Difco yeast 
extract; 5 g sodium glutamate; 2.7 g ammonium sulfate; 740 ml water {autoclave); add 
200 ml 1 M potassium phosphate, pH 7.0; 60 ml 50% glucose. (*For testing 
5 pantothenate production in liquid S V Y test tube cultures, Na a-k^toisovaterate and/or 0- 
alaninecan be added to a concentration of 5 g/L from filter-sterilized stocks.) 

PFMG f a yeast extract based medium used in fermentors: 20 g Amberex 

1 003™ yeast extract; 5 g sodium glutamate, 2 g ammonium sulfate; 5 g tryptophan; 10-g 

KH 2 P04; 20 g K 2 HP0 4 *3H 2 0; 1 g MgCl 2 *H 2 0; 0.1 g CaCl 2 -2H 2 0; 1 g sodium citrate; 

10 0.01 g FeS0 4 -7H 2 0; 1 ml trace elements solution; 20 g .glucose; add water to 1 L. 

Glucose or other sugars are fed as needed. Feed solutions can contain minerals, defined 

♦ 

or food grade nutrients. 

PF, a chemically defined pantothenate free medium for testing pantothenate 
auxotrophy: 100 ml 10 x Spizizen Salts; 100 ml 1 x Difco Pantothenate Assay Medium; 
15 1 0 ml 50% glucose; water to 1 liter. 

For pantothenate auxotj-qphs, 1 mM Na pantothenate is added to both minimal 
and rich media, since there is generally not enough pantothenate in rich media to support 
B. subtilis pan mutants. Amino acids are at 1 00 mg per liter, when used. 
Selection for antibiotic resistance is done with 5 mg/L chloramphenicol, 100 mg/L 
20 spectinomycin HC1, 15 mg/L tetracycline HCl, or 1 mg/L erythromycin plus 25 mg/L 
lincomycin. 

Pantothenate Assays: Biological assay. The indicator organism, Lactobacillus 
plqntarum, requires pantothenate for growth, and responds to low concentrations (ng/L). 

25 Thus, using serial dilutions, a wide range of concentrations can be assayed. 

Commercially available medium (e.g., Pantothenate Assay Medium (PAM), Difco), can 
be used. However, Difco PAM supplemented with pantothenate does not support 
growth to the same level as obtainable using a fresh-mixed version of Pantothenate 
Assay Medium (FM-PAM), made up of the individual components as specified by 

30 Difco, which is accordingly, routinely used instead of the commercial product. 

Before assaying B. subtilis culture supernatants, the B. subtilis cells must be 
either removed or killed. B. subtilis culture supernatants give approximately the same 
pantothenate titer when the supernatants are autoclaved as when they are sterile filtered. 
Accordingly, routine procedures involve autoclaving samples for 5 minutes prior to the 

35 biological assay. 



BNSDOCIO: <WO 0121772A2 I > 



WO 01/21772 



PCT/US00/25993 



-61- 

Pantothenate Assays : HPLC assay. Pantothenic acid production is measured 
by HPLC with a detector wavelength of 197 nm and a reference at 450 nm. The 
procedure is a modification of one recommended by Hewlett-Packard for water soluble 
vitamins. Samples of culture broth are diluted into an equal volume of 60% acetronitrile 

5 (ACN), centrifuged and filtered. Typically a further 10-fold dilution before analysis 
brings the final dilution to 20-fold. Higher concentrations of product are diluted further: 
Compounds are separated on a CI 8 Phenomenex 5ji Aqua 250 x 4.6 mm column with 
5% acetronitrile (ACN) in 50 mM Na phosphate buffer at pH 2.5. An ACN gradient 
from 5% to 95% washes the column between every sample. The area of the 

10 pantothenate peak is proportional to the concentration between 5 to 1000 mg/L. Other 
panto-compounds are also separated and quantitated by this method. 

Amino Acid Analysis: HPLC assay. Amino acids present in the fermentation 
medium and throughout the fermentation are measured by HPLC with a detector 
wavelength of 338 nm and a reference at 390 nm. The procedure is a modification of 

1 5 1 one recommended by Hewlett-Packard for amino acid analysis. Samples of culture 
broth are prepared identically as for the panto-compound analysis. Compounds are 
separated on a CI 8 Hypersil 5ja ODS 200 x 2.1 mm column. Solvent A is 20 mM Na 
acetate buffer at pH 7.2. Solvent B contains 40% ACN and 40% methanol. A gradient 
from 100% Solvent A to 100% Solvent B separates amino acids and washes the column 

20 between every sample. 

Batch Fermentations. Pantothenate producing strains are grown in stirred tank 
fermentors, for example, in CF3000 Chemap 14 liter vessels with 10 liter working 
volumes. Computer control and data collection is by commercial software, for example, 
B. Braun Biotech MFCS software. Fermentations can be batch processes but are 
25 preferably sugar-limited, fed batch processes. Some media components (e.g. ofSVYG 
and PMFG) are added to the fermentor and sterilized in place. Portions of the media are 
sterilized separately and added to the fermentors aseptically. This procedure is well 
known to those familiar with the art. Additional nitrogen sources in feeds are sterilized 
separately and added to the carbon source after cooling. 

30 The initial sugar in the medium is consumed in approximately 6 hours. 

Afterwards, glucose or other sugars are fed with the possible addition of minerals, and 
defined or food grade nutrients. Alternatively, feeds are scheduled based on a consensus 
profile of nutritional requirements from samples taken from earlier fermentations. 
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After inoculation, agitation is set at a relatively low speed, e.g. 200 rpm. When 
the dissolved oxygen (p02) falls to 30%, computer -control automatically adjusts the 
agitation to maintain a dissolved oxygen -concentration between 25 and 30% p02. 

5 EXAMPLE I: Enhanced Production of a Panto-Compound Using Bacteria 
Overexpressing panBCD Gene Products. 

This Example describes the cloning of the B. subtUis panBCD operon and the 
generation of microorganisms overexpressing thepa/iflCD^ene products. 

To clone the B. subtilis panBCD operon, a plasmid library of B. subtilis GP275 
1 0 (a derivative of 1 68) genomic DNA was transformed in E. coli BM4062 \birA*), and 
temperature resistant clones were selected at 42°C. By comparison of restriction maps 
to the genome sequence, one particular clone was deduced to contain the B. subtilis birA 
gene and the adjacent panBCD^enes. This plasmid was named pAN201 . 

To overexpress the panBCD operon and produce pantothenate, the native 
1 5 promoter of the panBCD operon was replaced by either of two strong, constitutive 
promoters derived from the B. subtilis bacteriophage ?P0 1 . These two promoters are 
named P 26 and P, ,. In addition^ -either of two artificial ribosome binding sites^RBSs) 
were used to replace the native panB RBS. These two artificial RBSs^set forth as-SEQ 
ID NO:49 and SEQ ID NO:50) were predicted to increase translation of panBCD, their 
20 sequences are shown in Table 1 A Three such engineered panBCD expression cassettes 
were built into circular plasmids capable of replicating in E. coli. Other features of the 
piasmids include a strong rho-independent transcription terminator from the E. coli 
ribosomal RNA transcription unit, called T,T 2 , a Gram-positive chloramphenicol 
resistance gene (cat), derived from pC194, and a pair of Notl restriction sites at the 
25 junctions between the E. coli replicon and the segment intended for integration into S. 
subtilis. Three plasmids of this series, pAN004, pAN005, and pAN006 were 
constructed. pAN004 contains the P 26 promoter, RBS1, and a low copy £. ro/i replicon. 
pAN005 contains the Pis promoter, which in our experience is not as strong as P 26 , 
RBS1 , and the low copy replicon. pAN006 contains the P 26 promoter, HBS2, and a 
30 medium copy replicon. 

The three panBCD expression cassettes contained in the above-mentioned three 
plasmids were all ligated to a DNA fragment consisting of sequences that naturally 
occur immediately upstream from the native panB gene and integrated in single copy by 
homologous recombination into the panBCD locus of B. subtilis strains RL-1 and PY79, 
35 replacing the wild-type operon. This was accomplished in two steps. First a delelion- 
substitution that replaced about two thirds of the panB coding region with a Gram- 
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positive spectinomycin resistance gene (spec) was integrated at panB to yield Spec", 
pantothenate auxotrophs. These intermediate strains were, than transformed with the 
panBCD expression cassettes of pAN004, pAN005, and pAN006 after ligating them to a 
DNA fragment containing chromosomal sequences just upstream of panB. Selection of 
5 the incoming cassette was for pantothenate prototrophy. The resulting strains were 
named PA221, PA222 and PA223 {from RL-1), and PA235, PA232 and PA233 (from 
PY79), respectively. An example of a plasmid that contains the joined upstream 
sequence that is in the integrated strain in PA221 is pAN240 (see Figure 2). The 
nucleotide sequence of pAN240 is set forth as SEQ ID NO:76. 

1 0 Polymerase chain reaction using appropriate primers was used to verify the ' 

correct chromosomal structures of these engineered strains. When extracts of strain 
PA221 were examined by SDS-PAGE, two proteins were found to be overexpressed. 
One protein had an apparent molecular weight of 29,000 and the other protein appeared 
to be 39,000 daltons. The 29,000 dalton bands is presumably PanB {predicted molecular 

1 5 weight of 29,761). The larger protein band presumably represents PanC (predicted size 
31,960 daltons). 

The ability of these strains to produce pantothenate in test tube cultures was 
assesed as follows. Each strain was grown in SVY medium supplemented with 5 g/L a- 
ketoisovalerate (a-KIV) and 5 g/L p-alanine, to ensure that these precursors were not 
20 limiting. Culture supernatants were autoclaved and assayed using the bioassay. 

Relative to the parent strains, RL-1 and PY79, the engineered strains produced about 8- 
to 30-fold more pantothenate, attaining 1 g/L pantothenate in some cases. 
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Table 2. Production of pantothenate byengineeredB.subtilis strains in liquid test 
tube cultures grown in SVY medium with 5,g/L a-KlV and 5 g/L fi- 
alanine. 

. RfiS at {pantothenate] 
Expt. Strain Promoter panh mg/L 



RL-1 


Native 


IN all vc 


30 


PA221 


P« 


RBS1 


990 








790 


PA222 


P» 


RBS1 


250 








250 


PA223 


P« 


RBS2 


790 








790 


PY79 


Native 


Native 


40 


PA235 


P« 


RBS1 


930' 




860 


PA221 


P,. 


RBS1 


1100 




1030 



5 The P 26 promoter was about 3- to 4-fold more effective than the P„ promoter, 

while RBS1 and RBS2 were roughly equivalent. Plasmids such as pAN004 5 pAN005, 
P AN006 can also be recombined as circles into the B. subtilis wild type panBCD locus 
by Campbell-type (single crossover) integration, selecting for chloramphenicol 
resistance at 5 mg/L. Strains obtained in this fashion produce about the same amount of 

10 pantothenate as strains PA221, PA222, and PA223, respectively. pAN004 containing 
the P 26 promoter, RBS1 and a low copy E. coli replicon, is depicted schematically in 
Figure 3A. The nucleotide sequence of plasmid pA*J004 is set forth as^EQ ID NO:93. 
P AN006 containing the P 26 promoter, RBS2 and a medium copy E. coli replicon, is 
depicted schematically in Figure 3B. The nucleotide -sequence of plasmid pAN006 is set 

1 5 forth as SEQ ID NO:94. The nucleotide sequence of panBCD is set forth as SEQ ID 
NO:59 and the predicted amino acid sequences of PanB, PanC and PanD are set forth as 
SEQ ID NO:24, SEQ ID NO:26 and SEQ ID NO:28, respectively. Methods for 
manipulating Bacilli are described, for example, in Harwood, C.R. and Cutting, S.M. 
(editors), Molecular Biological Methods for Bacillus (1990) John Wiley & Sons, Ltd., 

20 Chichester, England, the content of which is incorporated herein by reference. 
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EXAMPLE II: Enhanced Production of a Panto-Compound Using Bacteria 
Overexpressing the panEl Gene Product - Ketopantoate Reductase. 

This Example describes the cloning of the B. subtilis panEl gene and the 
generation of microorganisms overexpressing the panEl gene product. 

5 Pan" B. subtilis strains (e.g. , B. subtilis mutants blocked in the synthesis of 

pantothenic acid) had previously been isolated, one of which was reported to be affected 
in ketopantoate reductase activity (Baigori et al. (1991) J. Bacteriol 173:4240-4242). 
However, the mutations in these strains were incorrectly mapped to the purE-lre interval 
of the B. subtilis genetic map which does not contain the panE or panBCD genes. 

1 0 Furthermore as shown below, a panE mutant does not have a Pan" phenotype as the ilvC 
gene product can substitute for the panE gene product in B. subtilis as in other bacterial 
strains such as E. coli. More recently, the S. typhimuruim panE gene has been located 
and determined to be allelic to apbA, a gene required for anaerobic purine biosynthesis 
(Frodymae/a/. (1998)7. Biol Chem. 273:5572-5576). E. coli carries a highly 

15 homologous gene at the same map location. Identification of the panE genes in E. coli 
and S. typhimurium was complicated by the fact that the ilvC gene product, 
acetohydroxy acid isomeroreductase, is also capable of carrying out the ketopantoate 
reductase reaction. As a result, pantothenate auxotrophy is not obtained unless both 
panE and ilvC are mutated. 

20 To identify the B. subtilis panEl gene, the B. subtilis genome was searched using 

the protein sequence of E. coli or S. typhimurium ApbA (PanE), and two open reading 
frames were identified having homology to ApbA, named ylbQ BndykpB. These genes 
were renamed panEl and panE2, due to their proposed function in pantothenate 
biosynthesis. Both panEl and panE2 were cloned as PCR products generated from 

25 RL-1 genomic DNA as a template. Both genes were disrupted by either a 

spectinomycin resistance gene (spec) or a chloramphenicol resistance gene (cat). The 
interrupted genes were each integrated by double crossover into PY79 to give PA240 
(ApanEl::spec) and PA241 (ApanE2: :cal). Neither of these strains were pantothenate 
auxotrophs when tested on pantothenate-free (PF) plates, although PA240 containing 

30 ApanElr.spec grew slightly more slowly on TBAB without added pantothenate than 
with a 1 mM pantothenate supplement. By comparison, a bpanBwspec strain does not 
produce single colonies on TBAB, presumably because B. subtilis has no active uptake 
system for pantothenate. 

It was hypothesized that the B. subtilis gene, i/vC, could function for panE as had 

35 been shown for E. coli. Accordingly, the panEl and panE2 disruptions were introduced 
into a strain, CU550, which is reported to be trpC2 ilvC4 leuC124. Both the single 
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panEl and the double pan£I i pan£2 <\isruptants were pantothenate auxotrophson PF 
medium. 

"Table 3. Phenotypes of various pan El and panE2 mutants on rich and defined 
5 media. 



Growth*: 



Strain 


Medium - pan 


+ pan 


PY79 


TBAB +++ 
PF ++ 


+++ 
++ 


PA240 


TBAB spec + 
PF ++ 


+++ 
++ 


PA241 


TBAB cam +++ 
PF ++ 


+++ 
++ 


CU550 

i 


TBAB +++ 

PF -H- 


+++ 

'++ 


PA256 


TB AB spec 
PF 


+++ 
++ 


PA258 


TBAB spec, cam 
PF 


+++ 

+4- 



♦Each represents about 1 mm of colony diameter after overnight at 37°C 

Thus, mutating both panEl and ilvC results in pantothenate auxotrophy, while 
mutating only panEl does not, similar to what has been reported Tor E.coli and 5. 
10 typhimurium. 

Next, the quantitative effect of panEl and panE2 knockouts in a pantothenate 
overproducing strain (PA235 described herein) was examined. The panEl and panE2 
disruptions were introduced into PA235, either singly or together to produce PA245 
(ApanEl ::spec), PA248 (Apa*£2::cat) and PA244 (ApanEl ::cat, Apa/i£2::spec). The 
1 5 effect of each mutation on pantothenate production was then tested in liquid test tube 
cultures. 
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Table 4. Pantothenate production by PA235 derivatives containing panEl qnd 



panE2 disruptions. 


Strain 


[pan] mg/L 


%ofPA235 


rA233 


990 


. (100) 


PA235 


940 


95 


PA245 


59 


6 


PA245 


82 


8 


PA248 


1060 


106 


PA248 


1030 


104 


PA244 


25 


3 


PA244 


50 


5 



Thus, deletion analysis indicated that the panEl gene contributes to over 90% of 
5 the pantothenate production, while deletion of panE2 did not have a significant effect on 
pantothenate production. It is therefore concluded that panEl accounts for most, but not 
necessarily all, of the ketopantoate reductase activity in B. subtilis. The rest of the 
ketopantoate reductase activity is predicted to be supplied by i/vC. 

Having identified panEl as an important gene for pantothenate production, 

1 0 increased panEl expression was tested to determine whether it could enhance 
pantothenate production in strains such as PA221 or PA235. The panEl coding 
sequence was installed downstream of the P 2 e promoter and RBS2 in a vector, pOTP61, 
designed to integrate and amplify at either the bpr locus <a non-essential protease gene) 
or at the locus of the cloned insert. The resulting plasmid, pAN236 (Figure 4) was 

1 5 transformed into PA22 1 , selecting for resistance to tetracycline at 1 5 mg/L. The 
nucleotide sequence of pAN236 is set forth as SEQ ID NO:77. One transformant, 
named PA236 was chosen for further study. 

PA236 was shown to overexpress a protein of about 31,000 daltons, which is 
close to the expected molecular weight of 33,290 daltons for panEl protein. Briefly, 

20 whole cell extracts were prepared from P Y79, RL- 1 , P A22 1 , PA22 1 /pOTP6 1 and P A236 
(2 samples). Cell extracts were separated by gel electrophoresis and the gels were 
coomassie stained to visualize proteins. In cells engineered to overexpress panE 
(PA236-1 and PA236-2), a band was visible having an approximate molecular weight of 
-31,000 daltons (as compared to molecular weight markers). Moreover, PA221 and 

25 PA236 expressed increased levels of a -29,000 dalton band, corresponding to the panB 
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gene product, and a -39,000 dalton band, presumably corresponding the pawC gene 
product. Furthermore, E. coli transformed with pAN006 (Figure 3B) expressed bands 
correlating to the panB and panCgent products and E. coli transfected with PAN236 
expressed a -31,000 dalton band corresponding to the panE gene product. 

5 Next, PA236 was compared to P A22 1 carrying the empty vector pOTP6 1 for 

pantothenate production in liquid test tube cultures supplemented with 5 g/L p-alanine 
and 5 g/L a-KIV. 

Table 5. Effect ofoverexpression of panEl and panE2 on pantothenate 
10 production by engineered strains in liquid test tube cultures. 



Strain 


Additional 


Gene 


[Pantothenate] 




Plasmid 


Overexpressed 


mg/L i 


PA221 


pOTP61 


none 


1,000 








940 


PA236 


pAN236 


panEl 


2,030 








2,050 


PA238 


pAN238 


panE2 


'530 








680 



Overexpression of panEl caused a twofold increase in pantothenate production 
when compared to the parent strain (e.g. , to slightly over 2 g/L) whereas overexpression 
of panE2 resulted in a strain that produced about 35% less pantothenate than the parent 
1 5 strain. The panEl nucleotide sequence and predicted amino acid sequence are set forth 
as SEQ ID NO:29 and SEQ ID NO:30. 

EXAMPLE III: Enhanced Production of a Panto-Compound by Culturittg 
Bacteria Overexpressing panEl or panBCD in the Presence of Valine. 

20 The ability of valine to function as a media supplementing., as a substitute for 

a-KIV) in strains engineered to overexpress the panBCD operon and panEl was 
evaluated. Valine is closely related to a-KIV by transamination, is less expensive than 
a-KIV, and is commercially available in kilogram quantities. Valine was substituted for 
a-KIV in the standard liquid test tube cultures in SV Y medium. The concentration of 

25 valine was varied from 5 to 50 g/L. Although valine at 5 g/L was slightly less effective 
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than a-KIV in promoting pantothenate production, valine at 10 or 20 g/L equaled or 
surpassed 5 g/L a-KIV in promoting pantothenate production. 

EXAMPLES IV-X Generation of Microorganisms Capable of Producing 
5 Pantothenate in a Precursor-Independent Manner 

B. subtilis strains such as PA221 and PA235 (engineered to overexpress 
panBCD) and PA236 (engineered to overexpress panBCD and panEl) need to be fed a- 
ketoisovalerate (a-KIV) (or valine) and aspartate (or 0-alanine) to achieve maximal 
pantothenate production, as both these precursors are limiting for pantothenate 
10 synthesis. Accordingly, manipulated microorganisms were designed to eliminate the 
need to feed limiting precursors of pantothenate biosynthesis in the production of 
pantothenate. These strains are also useful in the production of various pantothenate 
biosynthetic pathway intermediates. 

1 5 EXAMPLE IV: Generation of Microorganisms Capable of Producing 
Pantothenate in an Aspartate- (or p-Alanine) Independent Manner 

The panD gene was cloned into B. subtilis expression vector pOTP61 to 
construct pAN423 (Figure 5). The nucleotide sequence of pAN423 is set forth as SEQ 
ID NO:78. The Not\ restriction fragment containing panD was isolated from pAN423, 
20 self ligated and used to transform PA22 1 . Transformants resistant to Tet 1 5 , Tet 30 , and 
Tet 60 were isolated and saved for further analysis. 

Six of the pAN423 transformants plus two control transformants were grown in 
SVY containing 5 g/1 a-KIV with and without 10 g/1 aspartate and then assayed for 
pantothenate production (Table 6). 

25 

Table 6. Effect of overproducing PanD on pantothenate production with and without 
added aspartate. 



Culture* 
(PA221 transformants) 


Asp 
(10 g/L) 


TetR** 
(jig/ml) 


ODsso 


[pan] 
(mg/L) 












pOTP61-1 




60 


8.0 


76 


pOTP61-2 




60 


7.7 


91 


423#1-1 




15 


8.5 


180 


423#l-2 




15 


8.0 


150 


423#l-3 




30 


8.3 


220 


423#l-4 




30 


8.5 


280 


423#l-5 




60 


8.9 


580 


423#l-6 




60 


8.8 


280 
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pOTP61-l 




60 


/.J 






+ 


60 


6.9 


560 


423#1-1 


+ 


15 


8.5 


1200 


423#l-2 


+ 


15 j 


8.6] 


1000 


423#l-3 


+ 


30 ; 


8.8] 


1200 


423#l-4 


+ 


30 


9.0 ; 


, 1200 


423#l-5 


+ 


60 : 


9.0, 


1200 


423#l-6 


+ 


60 


9.0 


1200 



♦Test tubes cultures were grown in SVY + a-KIV <5«/L) with Asp (lOg/L) where 



indicated. 

* *TetR = Approximate Tet-resistance of transformant 

The pAN423 transformants produced at least twice the amount of pantothenate 
5 as the controls {i.e., to a level at or near that which was obtained in -earlier experiments 
by the addition of P-alanine to the culture medium). The data also show that in the 
absence of added aspartate, transformants containing additional copies of ihz panD gene 
expression cassette produce more pantothenate than the control transformants. One of 
the transformants, 423#l-5, produced about five times as much pantothenate as the 
1 0 controls. These results indicated that increased levels of PanD protein "pull" the 
conversion of available aspartate towards p-alanine, and that increasing panD gene 
expression can result in enhancement of pantothenate production both in the presence 
and absence of added aspartate. 

Transformant 423#1 -5 was re-named strain PA401 and studied further in shake 
1 5 flask fermentations. The shake flask medium was SVY with maltose instead of SVY 
with glucose. Results of shake flask experiments agreed well with test tube experiments 
during the first 24 hours. In shake flask experiments without the addition of p-alanine, 
PA401 produced approximately 1 .5 g/1 of pantothenate in 24 hours. Addition of P- 
alanine to the culture medium did not further improve pantothenate titers (Table 7), 
20 indicating that with this -strain and these fermentation conditions, P-alanine is not 

limiting pantothenate production. In fact, when no p-alanine is fed, one can observe that 
PA401 is secreting P-alanine in significant amounts into the medium. 
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Table 7. Shake flask cultures with strain PA401 (panD) with and without fi-alan\ne. 





Amino acids (g/l) 


24 hours 


Initial 

P-ala 
Added 


P-ala 


Val 


PH 


OD600 


Pantothenate 
(g/D 


0 


0.7 


1.5 


7.5 


13.7 


1.5 


5 g/l 


7.1 


1.4 


7.6 


12.4 


1.5 



Each value represents the average of duplicate 250 ml baffled flasks containing 50 ml of 
5 medium, incubated at 37°C with shaking (200 rpm). 
Base Medium: SVY with 10 g/l ot-KIV, 30 g/l maltose 
2% Inoculum: SVY with Tet 15 grown 24 hours. » 

EXAMPLE V: Engineering the panD gene for Further Increased Synthesis of 
1 0 Aspartate Decarboxylase and Enhanced Production of Pantothenate 

This Example describes the generation of improved ribosome binding sites 
(RBSs) in the panD gene to increase the translation of panD mRNA. 

Increasing the translation of the panD gene mRNA by generation of synthetic panD 
RBSs 

15 The RBS (SEQ ID NO:88) used to express panD in pAN423 is a synthetic RBS 

and has been used to successfully produce other proteins in B. subtilis at a high level. 
However, it contains six mismatches when aligned to the "ideal" B. subtilis RBS (SEQ 
ID NO:45) (e.g., an RBS having a sequence which is complementary to the 16S RNA 
sequence within the B.subtilis ribosome). (See e.g., Table IB, mismatches in bold). 

20 Two new RBSs were designed to more closely mimic the ideal RBS. These synthetic 
RBSs, named new design A (NDA) and new design B (NDB) (also referred to herein as 
RBS3 and RBS4), are set forth as SEQ ID NO:5 1 and SEQ ID NO:52 and are aligned 
with the ideal RBS in Table 1 B. 

Oligonucleotides corresponding to the top and bottom strands of each new RBS 
25 were synthesized, annealed, then used to replace the RBS in pAN420, generating 
plasmids pAN426 and pAN427. These constructions are illustrated in Figure 6. The 
presence of the NDA and NDB RBS in pAN426 and pAN427 was confirmed by DNA 
sequence analysis. Next, the panD genes from pAN426 and pAN427 were transferred 
to B. subtilis expression vector pOTP61 as shown in Figure 7, creating pAN428 and 
30 pAN429. The nucleotide sequence of pAN429 is set forth as SEQ ID NO:79. 
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Notl restriction fragments lacking the £ coli vector sequences were isolated 
from pAN428 and pAN429, self-ligated, and used to transform strain PA221 to 
resistance to Tet ,s . Four isolates resistant to Tet 60 were picked from each transformation 
and assayed for pantothenate and p-alanine production along with PA221 transformed 
with the empty vector (pOTP61) and PA221 transformed with pAN423 (strain PA401) 
(see Table 8). 



10 



Table 8. 



Plasmid 



Pantholhenate production by test tube cultures ofPA221 transformed 
with pAN428 and pAN429 



P OTP61 
' pAN423 

pAN428-l * 
pAN428-2 
P AN428-3 
pAN428-4 

pAN429-l 
pAN429-2 
pAN429-3 
pAN429-4 # 



Medium 
Supplements 



a-KlV J 
a-KIV 5 

a-KIV 5 
a-KIV 5 
a-KIV 5 
a-KIV 5 

a-KIV 5 
a-KIV 
a-KIV 5 
a-KIV 5 



5 



OD550 



10 
10 

12 
11 
11 
10 

12 
11 
11 
12 



Pan 

g/l 



UND 
0.4 

0.6 
0.5 
0.3 
0.1 

0.6 
0.5 
0.6 
0.8 



(3-Ala 
g/l 



0.04 
0.04 

0.04 
0.03 
0:03 
UND 

0.04 
0.04 
0.05 
0.10 



pOTP61 
pAN423 

pAN428-l * 
pAN428-2 
pAN428-3 
pAN428-4 

pAN429-l 
pAN429-2 
pAN429-3 
pAN429-4 



a-KIV 5 + Asp] 0 
a-KIV 5 + Asp 10 

a-KIV 5 + Asp 1 ]* 
a-KlV 5 + Asp'° 
a-KIV 5 + Asp ° 
a-KIV 5 + Asp 10 

a-KIV 5 + Asp 1 0 

a-KIV 5 + Asp 1 " 
5 10 
a-KlV^ + Asp " 

a-KIV 5 + Asp 



11 

12 

12 
12 
12 
11 

12 
12 
12 
13 



0.5 
0.9 

0.8 
0.8 
0.9 
0.8 

0.8 
0.8 
0.8 
0.8 



0.08 
1.32 

1.97 
1.51 
1.02 
0.30 

1.78 
1.66 
1.78 
2.28 



UND: Below the limits of detection. 



Renamed PA402 # Renamed PA403 



When grown in medium supplemented with a-KIV at 5 g/l (a-KJV 5 ), the 
pAN428-l transformant and all four of the pAN429 transformants produced more 
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pantothenate than did PA401, suggesting that these transformants contain higher .levels 
of aspartate decarboxylase activity. When grown in medium supplemented with a-KIV s 
and Asp 10 none of the pAN428 or pAN429 transformants produced more pantothenate 
than PA40 1 . However, the pAN428-l transformant and all four of the pAN429 
transformants produced significantly more ^-alanine than did PA401 . It is possible that 
the excess p-alanine produced from added aspartate causes inhibition of pantothenate 
production. Alternatively, p-alanine may accumulate because pantoate is limiting in 
these strains. 

The strains that produced the highest level of P-alanine, the pAN428-l and 
pAN429-4 transformants, were renamed PA402 and PA403, respectively. These two 
strains were grown in SVY medium supplemented with various intermediates and 
reassayed for pantothenate and ^-alanine production. PA221 and PA401 were included 
as controls. The results of the assays are presented in Table 9. 



15 Table 9. 



Pantothenate production ofPA402 andPA403 in test tube cultures. 



Strain 



Medium 
Supplements 



ODsso 



Pan 

en 



P-Ala 

g/1 



Val 

g/1 



PA221 
PA401 
PA402 
PA403 



Ot-KIV"' 
a-KlV 5 
a-KIV 5 
a-KlV 5 



7.9 
8.7 
8.5 
9.4 



UND 
0.3 
0.5 
0.7 



UND 
0.04 
0.04 
0.07 



0.9 
0.9 
0.9 
0.9 



PA221 
PA401 
PA402 
PA403 



a-KIV 5 + Asp 10 

™/5 . A 0 
a-KIV +Asp 

«, |\/5 , a 10 

a-KIV +Asp 
a-KIV +Asp 



9.8 
9.1 
9.4 
9.7 



0.4 
0.8 
0.8 
0.7 



0.11 
1.15 
2.02 
2.40 



0.8 
0.8 
0.8 
0.8 



PA221 
PA401 
PA402 
PA403 



PA221 
PA401 
PA402 
PA403 



Pantoate 
Pantoate 
Pantoate 
Pantoate 



5 



5 



8.9 
8.7 
10.6 
10.5 



UND 
0.3 
0.5 
0.7 



Pantoate 5 + Asp 10 
Pantoate 5 + Asp 10 
Pantoate^ + Asp ( p 
Pantoate +Asp 



9.5 
9.2 
9.1 
10.2 



0.4 
2.2 
2.8 
2.9 



UND 
0.02 
0.02 
0.02 



0.06 
0.62 
1.17 
1.58 



0.2 
0.2 
0.2 
0.2 



0.2 
0.2 
0.2 
0.2 
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UND: Below the limits of detection. 

When grown in medium supplemented with either a-KIV 5 or Pantoate 5 , PA402 

and PA403 produced significantly more pantothenate than did PA401. As before, even 

though PA402 and PA403 produced significantly more P-alanine than PA401 when 
5 grown in medium supplemented with a-KIV 5 and Asp 10 , they did not produce a 

proportional increase in pantothenate. However, whengrown in medium "supplemented 

with Pantoate 5 plus Asp 10 , both PA402 and PA403 produced significantly more 

pantothenate than PA40 1 , about a 30% increase. 

It can be concluded from these experiments that the improved NDA andNDB 
1 0 panD ribosome binding sites, engineered into pAN428 and pAN429, respectively, lead 

to increased levels of aspartate decarboxylase activity. 

Increasing the translation of the panD gene mRNA by generat ion of synthetic panD 
RBSs within thepanBCD operon 

15 

The native B. subtilis panD gene ribosome binding site (RBS) (SEQ ID NO:43), 
which is found in the P 2 6panBCD operon cassette present in PA221 (and in other 
engineered pantothenate production strains described herein), is shown in Table 1C 
aligned with the ideal ribosome binding site (SEQ ID NO:47). The alignment shows 
20 mismatches between the native B. subtilis panD gene RBS, which is located within the 
coding sequence for PanC, and the the ideal RBS. Three new RBSs (within the P26 
panBCD operon cassette) were generated to increase translation of the panD gene 
mRNA and to yield increased synthesis of aspartate decarboxylase. These synthetic 
RBSs (termed NDI, NDII, and NDIII, also referred to herein as RBS5, RBS6 and RBS7, 
25 respectively) are set forth as SEQ ID NOf55, SEQ ID NO:56 and SEQ ID NO:57, 

respectively) and are included in Table 1C. It should be noted that although changes in 
the panD RBS within the panBCD operon also changes the C-terminal amino acid 
sequence of the PanC protein encoded by that operon, an alignment of known and 
suspected PanC protein amino acid sequences showed that the sequence of the last nine 
30 amino acids of the B. subtilis PanC protein could be altered without affecting any 
conserved amino acid residues indicating that such changes should not reduce 
pantothenate synthetase activity or expression. The new RBSs were synthesized and 
incorporated into the P 26 panBCD operon expression cassette as follows. 

First, PCR primers were designed to contain the following elements: (1) a 
35 nucleic acid sequence encoding the first five amino acids of PanD up to and including a 
. unique Bsi WI restriction site that had been previously introduced into panD by PCR; (2) 
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a stop codon for panC, (3) at least one synthetic RBS; and (4) 30-39 bp of nucleic acid 
sequence having 1 00% identity with panC upstream of the panD RBS. The primers 
were named TP102, TP103, and TP104 and contain the NDI, NDII, and NDIII ribosome 
binding sites, respectively. These three primers were used in conjunction with the 5' 
5 1 primer TP1 01 , which hybridizes near the start codon of panC, in three independent PCR , 
reactions to generate the NDI, NDII, and NDIII PCR products. The PCR products were 
purified, digested with Xbal, then cloned into plasmid vector pASK-lBA3 which had 
been digested with Xbal and Smal. The resulting plasmids were named pAN43 1 , 
pAN432, and pAN433. The construction of pAN431 is illustrated in Figure 8 and is 
1 0 representative of all three plasmid constructions. The presence of the desired synthetic 
panD gene RBS in each new plasmid was confirmed by DNA sequencing. 

Next, the modified panC genes containing the new panD RBSs were joined with 
the panD gene utilizing the unique BsiWl restriction site. This was accomplished by 
isolating the appropriate Nsil-BsiWl restriction fragments frojn pAN431, pAN432, and 
1 5 . pAN433 and ligating them with a 2395 bp Nsil-BsWl restriction fragment from 

pAN420, which supplied the Zfc/WI-modified panD gene. These constructions resulted 
in plasmids pAN441, pAN442, and pAN443, respectively. A representative 
construction (pAN441) is illustrated in Figure 9. The nucleotide sequence of pAN443 is 
set forth as SEQ ID NO:80. 

20 The new panD gene RBSs were then substituted into the P 2 eponBCD operon 

expression cassette as follows. First, a deletion-insertion mutation which removes the 
region of panC containing the panD RBS was created. This was constructed by 
digesting pAN430 with a mixture of BspEl and Bglll and recovering the 4235 bp 
fragment which is now missing the 3' end of panC and the 5' end of panD. This 

25 fragment was ligated with an Aval-BaniHl restriction fragment from plasmid pECC4, 
which contains the chloramphenicol acetyl transferase (cat) gene. The 5' extension 
produced by Aval digestion is compatible with that produced by BspEl while the Bglll 
and BamUl extensions are also compatible. The resulting plasmid was named pAN440, 
and its construction is illustrated in Figure 10. 

30 The resulting deletion-insertion mutation was crossed into the P26 panBCD 

operon via homologous recombination by transforming PA221 with linearized pAN440 
and selecting for resistance to chloramphenicol on Cam 5 plates containing 1 mM 
pantothenate. Several transformants were tested, and were all found to require 1 mM 
pantothenate for growth, as expected. Two of these transformants were remaned 

35 PA408A and PA408B and were assayed for pantothenate production. Neither strain 
synthesized measurable quantities of pantothenate, even when grown in medium 
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containing pantoate and P-alanine at 5 g/1, indicating that the strains are deficient in 
pantothenate synthetase activity. Next, the new panD RBSs were crossed into the P 2 6 
panBCD operon by transforming PA408 with linearized pAN44 1 , pAN442, and 
pAN443 plasmid DNA and selecting for growth on TBAB plates without pantothenate 
5 supplementation. A transformation with linearized pAN430 (including the native panD 
RBS) was included as a control and was expected to give rise to transformants identical 
to PA221 described herein. Four isolates from each transformation were assayed for 
pantothenate and P-alanine production in SVY medium supplemented with various 
intermediates (Tables 10 and 1 1). 

10 

Table JO. Pantothenate production of PA410 - PA4J3 in test tube cultures. 



strain 


one 

IvDO 


tVTpHiiim 

I'ICU 1 tl 111 




Pan 


R-AIn 






Supplements 




g/l 




PA221 


native 


raniOaie 


1 1 

1 1 


UND 


UND 


PA410-1 


native 


■c 

Pantoate $ 


12 


UND 


UND 


PA4I0-2 




Pantoate^ 


12 


UND 


UND 


PA410-3 




Pantoate. 


12 


UND 


UND 


PA410-4 




Pantoate 


12 


UND 


UND 


PA4I1-1 


NDI 


Pantoate^ 


12 


0.23 


UND 


PA411-2 




Pantoate^ 


12 


0.20 


UND 


PA4U-3 




Pantoate^ 


12 


0.19 


UND 


PA411-4 




Pantoate 


12 


UND 


UND 


PA412-1 


NDH 


Pantoate^ . 


12 


UND 


UND 


PA412-2 




! Pantoate^ 


11 


UND 


UND 


PA412-3 




Pantoate^ 


13 


0.18 


UND 


PA412-4 




Pantoate 


12 


0.18 


UND 


PA413-1 


NDIII 


Pantoate^ 


12 


0.18 


UND 


PA413-2 




Pantoate 5 


12 


0.17 


UND 


PA413-3 




Pantoate^ 


12 


0.16 


UND 


PA413-4 




Pantoate 


12 


0.17 


UND 



UND: Below the limits of detection. 
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Table 1 1. Pantothenate production of PA410 -PA413 in test tube cultures. 



Strain 


RBS 


Medium 
Supplements 


ODSSO 


Pan 

8/1 


P-AIa 

g/1 


PA22I 


native 


Pantoate 5 + Asp 10 


11 


0.3 


UND 

l 


PA410-1 
PA4I0-2 
PA410-3 
PA410-4 


native 


Pantoate 5 + Asp 1 jj 
Pantoate^ + Asp jjj 
Pantoate- + Asp. A 
Pantoate + Asp 


12 
12 
12 
12 


0.4 
0.4 
0.4 
0.4 


UND 
UND 
UND 
UND 


PA411-1 
PA411-2 
PA411-3 
PA41I-4 


NDI 


Pantoate 5 + Asp]j? 
Pantoate^ + Asp J J? 
Pantoate^ + Asp ^ 
Pantoate + Asp 


13 
13 
13 
13 


1.7 
1.7 
1.8 

0.4 1 


0.4 
0.4 
0.3 
UND 


PA4I2-1 

HA A 10 0 

PA412-3 
PA412-4 


NDII 


Pantoate 5 + Asp! J? 
Pantoate, + Asp . . 
Pantoate^ + Asp " 
Pantoate + Asp 


13 
12 
12 
12 


0.4 

A A 

0.4 
1.6 
1.5 


UND 
UND 
0.3 
0.2 


PA413-1 
PA413-2 
PA413-3 
PA413-4 


ND1II 


Pantoate* + Asp}!! 
Pantoate^ + Asp " 
Pamoate^ + Asp J" 
Pantoate +Asp 


13 
13 
13 
13 


1.6 
1.6 
1.7 
1.7 


0.3 
0.4 
0.4 
0.4 



UND: Below the limits of detection. 



5 As expected from previous experiments using PA221, none of the transformants 

that contained the native panD RBS produced measurable quantities of pantothenate 
when grown in medium supplemented with pantoate. However, nine of the twelve 
transformants expected to contain modified panD RBSs produced significant quantities 
of pantothenate (160-230 mg/1) under these conditions, indicating that they possess 

1 0 elevated levels of aspartate decarboxylase activity. When grown in medium 

supplemented with both pantoate and aspartate, these same nine transformants produced 
approximately four times more pantothenate than those with the native panD RBS. In 
addition, these nine transformants accumulated measurable quantities of P-alanine (230- 
410 mg/1). All transformants produced roughly equivalent quantities of pantothenate 

1 5 when grown in medium containing pantoate and p-alanine, demonstrating that each 
contains a functional pantothenate synthetase. 
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These data demonstrate that the synthetic panD RBSs are about four times more 
effective than the native panD RBS in directing translation of the panD rgene mRNA and 
evidence the utility of such synthetic RBSs in enhancing pantothenate production. 
Additional approaches to increasing pantothenate production can include, for example, 
5 increasing the half-life of the panD gene mRNA, increasing the strength of the promoter 
for panD transcription and/or increasing the stability of the PanD protein. 

EXAMPLE VI: Construction of Strains Containing an Integrated Pj 6 panEl 
Cassette without an Antibiotic Resistance Gene. 

10 

Example II describes the identification of the B. subtilis panEl -gene that encodes 
the enzyme responsible for the majority of the ketopantoate reductase activity in B. 
subtilis. PA236 (containing the pAN236 plasmid) produced about twice as much 
pantothenate (2 g/1) as its parent strain, PA221 (1 g/1) in 24 hour SVY test tube cultures. 

1 5 PA236 was presumed to contain an amplified (-3 copies) integrated pAN236 plasmid 
based on selection for tetracycline resistance (the tetR-gene product being encoded on 
the pAN236 plasmid in addition to the P 2 e panEl cassette). Also useful in the 
methodologies of the present invention are strains that contain a single integrated 
unamplifiable copy of P 26 panEl at the panEl locus, for example, without an antibiotic 

20 resistance gene in the strain. Such a strain was generated as follows. 

A plasmid named pAN251 was derived from pAN236 by inserting additional 
chromosomal sequences just upstream and just downstream from the P26p<*nEl cassette. 
These additional sequences, which provide homology to allow integration of the P 2 6 
panEl cassette at panEl by double crossover, were obtained by PCRIrom chromosomal 
25 DN A as a template. pAN25 1 is shown in Figure 1 1 . The nucleotide sequence of 
pAN251 is set forth as SEQ ID NO:«l . 

Next, a strain was constructed which allowed selection for the incoming P 2 6 
panEl cassette. The strain included the following three components: <1 ) P 26 panBCD\ 
(2) ApanEl; and (3) i/vC, since both panEl and ilvC must be mutated to have a Pan" 
30 phenotype. The starting strain was CU550 (trpC2 y ilvC4, leuC124). The P 26 panBCD 
cassette from PA221 chromosomal DNA was introduced in two steps to create strain 
PA290. Next, ApanEl::spec was transformed into PA290, using chromosomal DNA 
from strain PA240, to give strain PA294</r/?C2, ilvC4, leuC124 y P 26 panBCD 9 
ApanEl::spec\ which is a strict pantothenate auxotroph. Finally, PA294 was 
3 5 transformed with plasmid pAN25 1 , selecting for pantothenate prototrophy, to give strain 
PA303. This strain was expected to have the genotype trpC2 9 ilvC4, kuC124, P, 26 
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panBCD, P 2 6 panEl. PA303 was checked for the correct chromosomal structure at the 
panEl locus by PCR using primers that flank the P 2 s insertion just upstream of panEl. 
The PCR product from PA303 was of the expected size, with a concomitant loss of the 
PCR product from the wild type panEl gene, consistent with having obtained the 
5 desired double crossover event. Furthermore, PA303 was tetracycline sensitive, which 1 
is also consistent with the desired double crossover event, as opposed to a Campbell- 
type single crossover of the plasmids into the chromosome. The trp, i/v, and leu 
auxotrophies from the parent strain were all maintained in PA303. 

In 24 hour liquid SVY test tube cultures, PA303 produced almost the same level 
10 of pantothenate as positive control PA236, and about twice as much as PA22 1 , which 
does not contain engineered panEl as indicated in Table 12. 

Table 12. Pantothenate production by 24 hr. test tube cultures ofPA303 and 
controls grown in SVY plus 5 g/l a-KlV and 5 g/l fi-alanine. 

15 



Strain 


OD600 


[pan] g/l 


PA221-1 


10.9 


0.85 


PA221-2 


10.5 


0.85 


PA236-1 


9.5 


1.74 


PA236-2 


9.3 


1.70 


PA303-1 


10.8 


1.66 


PA303-2 


10.7 


1.61 



EXAMPLE VII: Generation of Microorganisms Capable of Producing 
Pantothenate in an a-KIV (or Valine) Independent Manner 

20 a-ketoisovalerate (a-KIV) is a rate limiting intermediate for pantothenate 

production in certain strains deregulated for pantothenate synthesis. Addition of either 
a-KIV or valine at 5 g/l increases pantothenate production about 5-fold in test tube 
cultures with strains such as PA221 . In order to alleviate the need to feed either a-KIV 
or valine, strains were engineered that have an increased capacity to synthesize a-KIV. 

25 a-KIV is produced in B. subtilis from pyruvate by the sequential action of three 

enzymes encoded by four genes, ilvB and ilvN, /7vC, and ilvD. In a wild type B. subtilis, 
three of the genes (ilvB, ilvN, and ilvQ are the first three genes of the large ilv-leu 
operon. The fourth gene necessary for a-KIV synthesis, ilvD, is located by itself 
elsewhere on the chromosome. The B. subtilis ilv-leu operon is thought to be regulated 
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only by leucine levels. Feeding of exogenous leucine reduces transcription of the ilv-leu 
operon by about 13-fold, probably by an attenuation mechanism (Grandoni et al. (1992) 
J. Bacteriol. 174: 3212-3219). The only known feedback regulation in the ilv-leu , 
pathway is the inhibition of the leuA gene product by leucine. 

5 As a first step to deregulate the synthesis of a-KIV, a copy of the i/vBtfC region 

from the wild type B. subtilis ilv-leu operon was isolated by PCR, and installed adjacent 
to the P 26 promoter and RBS2 on a vector, pOLL8, that was designed to integrate a 
single P 26 expression cassette by double recombination at the amyE locus. The amyE 
gene encodes a nonessential a-amylase, and is a useful locus for installing expression 

10 cassettes. The resulting plasmid, pAN267, is illustrated in Figure 12. The nucleotide 
sequence of pAN267 is set forth as SEQ ID NO:82. pAN267 readily gave stable 
transformants by double crossover at the amyE locus of B. subtilis strains, as described 
in detail below. 

Construction of pantothenate overproducing, strains that are leucinejfrototrophs 

1 5 Initially, a B. subtilis strain containing z/vC* and ApanEl was used to introduce a 

single copy of P 26 panEl into the chromosome without using an antibiotic resistance 
gene. The double mutant was required to select for the incoming P 26 panEl cassette 
because a ApanEl mutation alone does not result in pantothenate auxotrophy. A strain 
named CU550 was obtained containing ilvC4 to be used as a basis for this type of strain 

20 construction. However, CU550 also contains a closely linked leuC124 mutation, so all 
strains derived from CU550 required leucine. Having shown that the combination of P26 
panBCD and P 26 panEJ was favorable for pantothenate production, the next step was to 
reassemble this combination of two cassettes in a leucine prototroph. 

Accordingly, the two cassettes were combined in two different strain 
25 backgrounds, RL-1 and PY79. To introduce chromosomal P M panEl into the PY79 and 
RL-1 strain backgrounds without using an antibiotic resistance gene, a strategy was used 
that did not rely on ilvC4. (The strategy took advantage of the observation that the 
ApanEl mutation causes a pantothenate bradytrophy, manifested by relatively small 
colonies on TBAB (rich) plates). First, ApanB.xat and ApanEr.spec were introduced 

30 into both strain backgrounds. Next, the resulting strains were transformed 

simultaneously With DNA from two strains, PA221 (P 26 panBCD) and PA303 (P 26 
panEl), selecting for Pan + on TBAB plates. Colonies of two distinct sizes grew on the 
selective plates, with the larger size-comprising about 2% of the colonies. The larger 
colonies were presumed to represented co-transformants that received -both P 26 panBCD 

35 and P 26 panEl , and that the smaller colonies had received only P 26 panBCD. Consistent 
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with this prediction, the larger colonies had lost both Cam r and Spec r , while the smaller 
colonies had lost only the cat gene, and retained the spec gene. Furthermore, a 
representative derivative of PY79 named PA327, and a representative derivative of RL- 
1 , named PA328, both produced the elevated levels of pantothenate in test tube cultures 
5 which was about 1 .6 to 1 .7 g/1 (Table 1 3). 

Table 13. Pantothenate production of PA327, PA328, and controls from 24 hr 

test tube cultures grown in SVYplus 5 g/l a-KIV and fi-alanine. 

P26P^nEl copy 



10 



Strain 


Background 


number 


{pan] g/l 


PA22I-1 


RL-1 


0 


0.92 


PA221-2 


RL-1 


0 


0.95 


PA236-1 


RL-1 


amplified (-3) 


1.60 


PA236-2 


RL-1 


amplified (-3) , 


1.73 


PA327-I 


PY79 


1 


1.66 


PA327-2 


PY79 


1 


1.65 


PA328-1 


RL-1 


1 


1.61 


PA328-2 


RL-1 


1 


1.91 


Thus, PA327 and PA328 were concluded to contain both P 2 6 panBCD and P 2 6 



panEly and were used for further constructions as described below. PCR analysis 
confirmed the presence of the two cassettes. 



15 Installation of a stable P26 ilvBNC cassette into two lineages of pantothenate 
overproducing strains 

Having constructed PA327 and PA328, derivatives of PY79 and RL-1 that 
contain P26 panBCD and P26 panEl, and that are Leu + , the next step was to introduce 
stable copies of P26 ilvBNC. This was accomplished by transforming PA327 and 
20 PA328 with plasmid pAN267, selecting for Spec r . Screening by PCR showed that about 
85% of the obtained transformants contain P26 ilvBNC integrated at amyE by double 
crossover. One transformant of PA327, named PA340, and one transformant of PA328, 
named PA342, were chosen for further study. 

In test tube cultures grown in SVY medium plus 5 g/l p-alanine but without 
25 added a-KIV, both PA340 and PA342 gave the expected increase in pantothenate 
production over that of PA327 and PA328, to about 1 .3 to 2 g/l (Table 14). 
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Table 14. Pantothenate and valine production by PA340 and PA342, both 

containing P 26 ilvBNC in 24 hr test tube cultures -grown in SVY with 5. g/l 
( ^-alanine and with or without 5 g/l a-KlV 



Strain 


Back- 
ground 


OD600 
-a-KJV 


+ o-KIV 


(pan] g/l 
-a-KIV 


+ a-KIV 


1val]g/l 
-a-KIV 


+ a-KIV 


PA340-1 


PY79 


11.8 


7.1 


2.02 


2.10 


0.38 


0.90 


PA340-2 


PY79 


10.3 


7.5 


1.97 


2.03 


0.40 


0.91 


PA342-1 


RL-I 


10.2 


8.0 


1.29 


1.89 


0.27 


0.78 


PA342-2 


RL-1 


9.6 


9.2 


p 


2.04 


0.21 


0.79 



The two new strains also gave a slight increase in valine secretion, indicating 
that the ilvBNC genes had been deregulated. However, when the same strains were 
grown with 5 g/l a-KIV added, a further increase in pantothenate production occurred 
1 0 from PA342, suggesting that a-KIV was still rate limiting in this strain background. 
Similar results, only with more -growth and hence higher pantothenate levels, were seen 
in shake flask cultures (Table 15). 



Table 15. Pantothenate and valine production by PA340 and PA342, both 

containing P 26 ilvBNC in 24 hour shake flask cultures grown in SVY with 
5 g/l ^-alanine and with or without 5 -g/l a-KIV. 



Strain 


Back- 
ground 


OD600 
- a-KIV 


+ a-KIV 


[pan] g/l 
- a-KIV 


+ a-KJV 


Tva1]g/1 
-a-KIV 


+ a-K!V 


PA327 


PY79 ! 


21 


22 


0.6 


3.0 


0.5 


1.3 


PA340-1 


PY79 


20 


20 


3.5 


4.1 


1.0 


1.9 


PA340-2 


PY79 


22 


19 


3.0 


2.1 


0.8 


1.4 


PA328 


RL-1 


20 


16 


1.4 


2.7 


0.6 


1.3 


PA342-1 


RL-1 


17 


16 


3.3 


3.6 


0.9 


1.6 


PA342-2 


RL-1 


18 


18 


3.1 


4.2 


0.8 


1.4 
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EXAMPLE VIII: Increasing panD Copy Number in Strains Engineered to , 

Overproduce panEl and the UvBNC Gene Products Enhances Pantothenate 

i 

Production 

Experiments where P-alanine was fed to cultures of engineered fl. subtilis strains 
5 consistently showed that P-alanine was a rate limiting intermediate in pantothenate 

synthesis. The effect of adding additional copies of panD on pantothenate production in 
PA340 and PA342 was examined. Strains PA340 and PA342 were transformed with 
chromosomal DNA isolated from PA401 with selection on plates containing 15 ng/ml of 
tetracycline (Tet 15 plates). Transformants derived from each parent were patched onto 
10 Tet 60 plates to identify those which were likely to contain multiple copies of the 
expression cassette. Twelve transformants from each transformation which grew on 
Tet 60 were streaked for single colonies on this medium and then assayed in SVY 
medium test tube cultures for pantothenate production. One trdnsformant from each 
group was found to produce greater than 300 mg/1 pantothenate in 24 hours. These two 
1 5 transformants were saved and named PA404 (PA340 strain background) and PA405 

(PA342 strain background). Both strains were resistant to spectinomycin, indicating that 
the P26 HvBNC expression cassette was still present at amyE. PCR analysis of 
chromosomal DNA isolated from each strain confirmed that the deregulated panEl gene 
had also been retained. 

20 Next, PA404 and PA405 were evaluated in shake flask cultures which were 

grown in SVY medium containing maltose as the carbon source and supplemented with 
various intermediates. The cultures were grown for 24 and 48 hours and then assayed 
for pantothenate, P-alanine, and valine production. The results of this experiment are 
presented in Table 16. Analogous shake flask culture data for the parent strains {PA340 

25 and PA342) are included in the tables for comparison. 

Table 16, Pantothenate production by PA404 and PA405 in shake flask cultures 
after 24 hours 



Strain 


Medium 
Supplements 


OD600 


Pan 

g/l 


p-AJa 

g/l 


Val 

g/l 


PA340 


none 


20 


0.4 


<0.1 


1.0 


PA404 


none 


22 


1.8 


<0.1 


0.7 


PA342 


none 


19 


0.3 


0.2 


0.7 


PA405 


none 


19 


1.4 


0.4 


0.5 
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PA340 P-alanine 5 ] 18 3.6 3.2 0.6 

PA404 p-alanine 5 ™ 2.8 ' 5.1 0.7 

PA342* p-alanine 5 17 3.3 3.3 0.5 

PA405* p-alanine 5 19 1.3 6.5 0.6 

Values are the average of duplicate flasks except where indicated by *. 

In the absence of any medium supplementation, PA404 and PA405 made four to 
five times more pantothenate in 24 hours compared to their isogenic parent strains 
5 (Table 16). The supply of p-alanine was clearly limiting in the parent strains PA340and 
PA342. Addition of amplified P26 panD greatly increased the supply of ^-alanine. 

EXAMPLE IX: Deregulation of the B. subtiiis UvD Gene Enhances Pantothenate 
Production 

1 0 To deregulate expression of the UvD gene, standard procedures {described 

above) were used to integrate the constitutive Pi6 promoter and an artificial ribosome 
binding site, RBS2, just upstream of the UvD coding region. The UvD gene maps by 
itself, unlinked to the UvBNC operon. First, a 2.4 kb region of the RL-1 chromosome 
that contains the UvD coding region and 730 bp of upstream sequence was cloned by 

1 5 PCR into a low copy (about 15 per E. coli cell) vectorx:alled pOKl 2,<to -give plasmid 
pAN257, shown in Figure 13. 

Taking advantage of a natural EcoJtl site just upstream of the native tfvDgene 
promoter, and a natural Nco\ site at the UvD start codon, an artificial sequence 
containing P26 and RBS2 was inserted into pAN257 to give pAN263 (Figure 14). The 

20 nucleotide sequence of pAN263 is set forth as SEQ ID NO:83. In paraHel with this 
construction, the a*/ -gene was also inserted into pAN257, between the -same upstream 
£coRI site and a Bglll site in the middle of the UvD coding region, to give pAN261 , 
which is deleted for a large portion of the UvD gene (Figure 15). 
Using pAN261 and pAN263, the P 76 UvD cassette could then be installed in the B. 

25 subtiiis chromosome in two steps. In the first step, pAN261 is introduced by 

transformation, selecting for chloramphenicol resistance, and then confirming an Uv" 
phenotype. In the second step, pAN263 is introduced, selecting for Hv + , checking for 
chloramphenicol sensitivity, and confirming correct local structure by PCR. 

pAN26 1 was first transformed into strain RL- 1 (highly competent) to give strain 
30 PA343 (AilvD::cat\ and then chromosomal DNA from PA343 was used to transform 
PA340 and PA342 to Ilv auxotrophy, yielding strains named PA348 and PA349, 
respectively. Chromosomal DNA is inherently more ^efficient than monomelic plasmid 
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in transforming B. subtilis. Similarly, pAN263 DNA was transformed into PA343 
(moderately competent) to giye strain PA345 (P 2 6 //vD), and then PA345 chromosomal 
DNA was used to transform PA348 and PA349 to Ilv + prototrophy, yielding strains 
PA374 and PA354, respectively. 

5 As predicted, PA374 and PA354 gave further increases in pantothenate 

production, to about 2.5 to 2.9 g/l, in test tube cultures grown in SVY plus 5 g/l P- 
alanine (Table 17). 

Table J 7. Pantothenate and valine production by PA374 and PA354, containing P 2 e 
1 0 il vD, and controls, in 24 hr test tube cultures grown in SVY with 5 g/l /?- 

alanine and with or without 5 g/l a-KIV 









OD 60 o 


[pan] 
g/l 


[val] 
g/l 


Strain 


Back- 
ground 


ilvD status 


a-KlV 


+ 


a-KJV 


+ 


a-KJV 


+ 


PA340 


PY79 


w.t. 


9.2 


9.0 


2.14 


2.23 


0.38 


0.90 


PA348 


PY79 


ilvD::cat < 


11.7 


10.0 


0.19 


2.23 


0.19 


0.91 


PA374-1 
PA374-2 


PY79 
PY79 


P26 HvD 
P26 HvD 


9.1 
8.2 


7.3 
7.7 


2.93 
2.99 


2.40 
2.36 


0.58 
0.60 


0.87 
0.95 


PA342 


RL-1 


w.t. 


10.2 


8.0 


1.29 


1.89 


0.27 


0.78 


PA349 


RL-1 


ilvD::cal 


8.1 


7.7 


0.17 


1.87 


0.22 


0.88 


PA354-1 
PA354-2 


RL-1 
RL-1 


P26 HvD 
P26 UvD 


9.6 
7.5 


9.6 
8.2 


2.57 
2.48 


2.03 
2.24 


0.65 
0.64 


1.23 
0.97 



In the absence of added P-alanine, strains PA374 and PA354 produced only 
1 5 about 0.2 g/l pantothenate in test tube cultures, indicating that PanD activity is 
significantly rate limiting. 

To alleviate this limitation, the amplifiable P 2 6 panD cassette from strain PA401 
was installed. PA401 chromosomal DNA was transformed into PA374 and PA354, 
selecting for Tet r at 15 mg/1, to yield strains PA377 and PA365, respectively. After 
20 transformants were obtained, the strains were streaked on plates containing 30 and 60 
mg/1 tetracycline to reamplify the copy number of the P 2 6 panD cassette integrated at the 
bpr locus. In test tube cultures grown in SVY without a-KIV or P-alanine, a substantial 
improvement in pantothenate titers over those of PA374 and PA354 was obtained 
(Tables 18 and 19). 



> 
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Table 18. Pantothenate production by PA365, containing amplified P26 panD, and 
controls, in 24 and 36 hr test iube cultures grown in SVY-glucose without 
^-alanine or a-KIV 



1 




OD 600 


[pan] g/l 




Rplpvant fjpnntvnp 










Strain 




24 hrs. 


36 hrs 


24 hrs. 


36 hrs. 


PA342-1-1 


w.t. ilvD 


11.7 


8.8 


b.d. 


0.27 


PA342-1-2 


w.t. ilvD 


12.8 


8.8 


b.d. 


0.26 


PA354-1-1 


P26 ilvD 


n.d. 


11.0 


n.d. 


0.19 


PA354-1-2 


P26 HvD 


n.d. 


8.4 J 


n.d. 


0.20 


PA365-1 


P26 ilvD, P26panD 


9.8 


10.0 


1.01 


2.07 


PA365-2 


P26ilvD,P26panD 


9.9 


10.4 


0.96 


2.09 



5 



' n.d. = not determined; b.d. = below detection 

Table 1 9. Pantothenate production by PA 377, containing amplified panD, and 
controls, in 27 hr test tube cultures grown in SVY-glucose or SVY- 
10 maltose, without a-KIV, and with or without 0-alanine. 







OD 600 


Strain 


Relevant genotype 


-P-ah 
Glucose 


+ P-ala 
Glucose 


- p-ala 
Maltose 


+0-ala 
Maltose 


PA374-1 
PA374-2 


P26 HvD 
P26 HvD 


•9.4 
9.2 


9.8 
9.6 


7.0 
6.6 


6.4 
6.3 


PA377-1 
PA377-2 


P26ilvD. P26panD 
P26'lvD, P26panD 


10.0 
10.5 

[pan] g/l 


7.6 
7.8 


7.2 
9.4 


6.1 
5.4 


Strain 


Relevant genotype 


-P-ala 
Glucose 


+ P-ala 
Glucose 


- P-ala 
Maltose 


+ p-ala 
Maltose 


PA374-1 
PA374-2 


P26 HvD 
P26 ilvD 


0.04 
0.10 


2.76 
2.65 


0.14 
0.15 


1.31 
1.33 


PA377-1 
PA377-2 


P26ilvD,P26panD 
P26ilvD,P26pa»D 


1.25 
1.25 


2.76 
2.35 


1.-26 
1.31 


1.10 
1.26 



In SVY with glucose, an increase in pantothenate production can still be 
1 5 achieved by feeding 5 g/l ^-alanine suggesting that increasing panD expression Turther 
might increase pantothenate production. In SVY with maltose, no further increase in 
pantothenate was obtained by feeding P-alanine -suggesting that ^-alanine and/or 
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aspartate synthesis is suppressed by glucose. Strains PA377 and PA365 have been 
evaluated in 10 liter fermentors, where they typically produce above 20 g/l pantothenate 
in 48 hours without supplemental p-alanine and a-KIV or valine, described in detail 
below. 

5 

EXAMPLE X: 10 liter Fermentations of Pantothenate-Producing Microbes 

Engineering of the P 26 UvBNC and P 26 ilvD cassettes to give strains PA342 and 
PA354 allowed the production of 22 and 26 g/l of pantothenate, respectively, without 
the addition of valine or a-KIV to the fermentation medium (Table 20). At 48 hours, 
1 0 both strains had secreted about 0.5 g/l of valine into the medium. 



Table 20. 10-liter fermentations of five pantothenate overproducing strains. 



Strain 


Medium 


Feed 
40% Glucose 
plus 


OD600 
48 hr 


Valine 
48 hours 
g/l 


p-ala 
48 hr 
g/l 


Pantothenate 
g/L 

36 hr 48 hr 72 hr 


PA 236 


SVYG 


50 g/I p-ala 
25 g/l a-KIV 


108 


added 


. added 


16 


19 


21 


PA 342 


SVYG 


50 g/l p-ala 


92 


0.5 


added 


17 


22 




PA 354 


SVYG 


50 g/l P-ala 


90 


0.5 


added 


19 


26 




PA 365 


SVYG 


25g/l YE 


77 


0.85 


0.4 


18 


21 


27 


PA 377 


SVYG 


25g/l YE 


85 


1.5 


0.5 


18 


22 


31 


PA 377 


PFMG 


25g/lYE 


96 


0.8 


0.4 


19 


25 


29 


PA377 


PFMG 




71 


0.7 


0.1 


16 


21 





15 Pantothenate synthesis in fermentors 

With the addition of the P 26 panD cassette to strains PA354 and PA374 to create 
strains PA365 and PA377, neither p-alanine nor a-KIV needed to be added to the 
fermentors. Strain PA365 produced 21 g/l pantothenate in 48 hours and 27 g/l in 72 
hours with no precursors added to the medium (Table 20). PA377 was somewhat better, 

20 producing 1 8 g/l of pantothenate in 36 hours, 22 g/l in 48 hours, and 3 1 g/l in 72 hours). 
Valine was measured at 0.85 and 1 .5 g/l for strains PA365 and PA377, respectively, at 
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48 hours in SVYG medium. Strain PA377 maintained valine between 1 -1 .5 -g/1 
throughout most of the fermentation and ^-alanine between 0.2 and 0.5 .g/1. 

, Strain PA377 was further evaluated in 10-liter 4ermentors in yeast -extract based 
PFMG medium. Pantothenate yields in PFMG and SVYG medium were similar. In 
5 PFMG, P A377 produced 1 9 g/1 of pantothenate in 36 hours, 25 g/1 in 48 hours, and 29 
g/1 in 72 hours. In SVYG, PA377 produced 18*g/L pantothenate in 36 hours, 22 g/L in 
48 hours and 3 1 g/L in 72 hours (Table 20). 

EXAMPLE XI: Converting Strain PA377 to a Tryptophan Prototroph 

[0 

PA377 (Trp") was transformed to Trp + using chromosomal DNA from PY79 to 

give strain PA824. After re-amplification of the P 2 6panD casette, PA824 was compared 

i 

to PA377 for pantothenate production in test tube cultures grown in SVY glucose with 
or without 5 g/L P-alanine (Table 21). 

15 

Table 2 1 : Trp* derivatives of PA377: Pantothenate production in 48 hour test tube 
cultures grown in SVYgiucose, ±fi-alanine 



Strain 


• 

irpC donor 


ODeoo 


[pan] g/L 


- p-alanine 


+ ^-alanine 


- P-alanine 


+ P-alanine 


PA377-1 


RL-1 


8 


8 


1.5 


3.4 


PA377-2 


RL-1 


8 


9 


1.6 


3.6 


PA824-1 


PY79 


12 


10 


0.7 


3.7 


PA824-2 


PY79 


11 


11 


1.9 


4.9 



20 The Tip+ strains -grew to slightly higher densities than PA377. In the absence of 

exogenous p-alanine, all of the strains produced similar levels of pantothenate, while 
with the addition of P-alanine, the Trp+ derivatives produced somewhat more 
pantothenate. 



25 Fermentor studies with PA824 

PA824 was evaluated in CF3000Chemap 14 liter vessels with 10 liter working 
volumes. Formulations for two of the media used in the fermentors are given in Tables 
22 and 23. 
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Table 22 : Formulation for PFMG-5 medium 



BATCH 





MATERIAL 


g/L (final []) 


1 


Amberex 1003 


10 


2 


Na Glutamate 


5 


3 


(NH4)2S0 4 


8 








4 


MAZU DF 37C 


2.5 








Added After Sterilization and Cool Down 








1 


KH 2 P0 4 


10 


2 


K 2 HP0 4 -3H 2 0 


20 








1 


Glucose 


20 


2 


MgCl 2 -6H 2 0 


1 


3 


CaCl 2 -2H 2 0 


0.1 








1 


Sodium Citrate 


1 


2 


FeS0 4 -7H 2 0 


0.01 


3 


SM-1000X 


1.0 ml 










H 2 0 


qs to 6000 ml 


FEED 




MATERIAL 


g/L 


1 


Glucose 


600 


2 


CaCl 2 -2H 2 0 


0.6 










H 2 0 


qs to 3000 ml 
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Table 23 : Formulation for SVY-4 medium 



BATCH 





MATERIAL 


g/L (final [J) 


1 


Veal Infusion 


25 • 


2 


Yeast Extract 


5 


3 


Na Glutamate 


5 


4 


(NH4)2S04 


' 4 








5 


MAZU DF 37C 


2.5 








Added After Sterilization and Cool Down 








1 


KH 2 P0 4 


10 


2 


K 2 HPCv3H 2 0 


20 








1 


Glucose 1 


20 


2 


MgCI 2 -6H 2 0 


1 


3 


CaCl 2 -2H 2 0 


0.1 








1 


Sodium Citrate 


1 


2 


FeS0v7H 2 0 


0.01 


3 


SM-1000X 


1.0 ml 










H 2 Q 


qs to 6000 ml 


FEED 




MATERIAL 


g/L 


1 


Glucose 


600 


2 


CaCl 2 -2H 2 0 


0.6 










H 2 0 


qs to 3000 ml 



All fermentations were glucose limited fed batch processes. Immediately after 
inoculation, agitation was set at 200 rpm. The initial batched 2% glucose was consumed 
10 during exponential growth. Afterwards, glucose concentrations were maintained 
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between 0.2 and 1.0 g/L by continuous feeding of a 60% glucose solution. Th6 variable 
rate feed pump was computer controlled and linked to the dissolved oxygen 
concentration [p0 2 ] in the tank by an algorithm. When the [p0 2 ]fell to 30%, computer 
control began to automatically adjust the agitation rate to maintain a dissolved oxygen 
5 concentration between 25 and 30% [p0 2 ]. Computer control and data recording were by 
Braun MFCS software. 

In one study, PA284 was grown in fermentors at two temperatures (40°C and 
43 °C) in the medium described in Table 22. Results of two experiments demonstrated 
that the highest pantothenate titers at early time points were produced at 43°C. The cell 

1 0 mass approached 1 50 optical density units at OD 6 oo and 56 hours at 43°C, and the 

pantothenate titers were 21 g/L, 28 g/L and 36 g/L at 36, 48 and 72 hours respectively. 
In the parallel fermentation at 40°C, the cell mass approached 120 optical density units 
at OD 6 oo and 56 hours, and the pantothenate titers were 18 g/L, 26 g/L and 37 g/L at 36, 
48 and 72 hours, respectively. 

1 5 In another study, PA824 was grown in a fermentor at 43°C in the medium 

described in Table 23. The cell mass exceeded 160 optical density units at OD 6 oo and 36 
hours, and the pantothenate titers were 23 g/L, 34 g/L, 37 g/L and 40 g/L at 24, 36, 48 
and 60 hours, respectively. In other fermentations, increasing the amount of trace 
elements in the glucose feed (e.g., increasing the concentration of SM from IX to 2X) 

20 resulted in even higher titers of pantothenate. 

EXAMPLE XII: Identification and characterization of the B. subtilis coaA gene 
product 

The annotated version of the B. subtilis genome sequence available on the 
25 "Subtilist" web site contains no gene labeled as coaA. However a homology search 
using the protein sequence of E. coli pantothenate kinase as a query sequence gave a 
good match with B. subtilis gene yqjS, which is annotated as "unknown; similar to 
pantothenate kinase." This gene appears to be the penultimate gene in an operon 
containing five open reading frames (Figure 18). Two of the open reading frames 
30 encode proteins which are similar to D-serine dehydratase and to "ketoacyl reductase"; 
the other two have no known homologies. For the open reading frame corresponding to 
coaA, there are three possible start codons; each having a possible ribosome-binding site 
(RBS) associated with it. The three potential coaA ORFs were named coaAl, coaA2, 
and coaA3, from longest to shortest. 
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All three potential coaA open reading frames were cloned along with their 
respective RBSs by PCR followed by ligation into expression plasmid pAN229. 
, pAN229 is a low copy vector in £ coli that provides expression from the SP01 phage 
Pis promoter and can integrate by single crossover at bpr with tetracycline selection. A 
5 representative resulting plasmid, pAN28 1 , is shown in Figure 1 9. • 

To determine if the cloned putative coaA ORFs actually encode a pantothenate 
kinase activity, several isolates of all three plasmids were transformed into the E. coli 
strain YH1, that contains the coaA15(Ts) allele. Transformants were streaked to plates 
incubated at 30° and 43°C to test for complementation of the temperature sensitive 
1 0 allele. All isolates of all three coaA variants, except for one isolate of pAN282, 

complemented well at 43°C, indicating that all three plasmid constructs encode an active 
pantothenate kinase. Accordingly, it can be concluded that the B. subtilis yqjS open 
reading frame codes for an active pantothenate kinase. 

1 5 EXAMPLE XIII: Deletion of the coaA gene from the B. subtilis genome 

The coaA gene of B. subtilis (yqjS) was deleted from the chromosome of a B. 
subtilis strain by conventional means. The majority of the coaA coding sequence was 
deleted from a plasmid clone and replaced by a chloramphenicol resistance gene (cat) y 
while leaving approximately 1 kb of upstream and downstream sequence to allow 

20 homologous recombination within the chromosome, to give plasmid pAN296 (see 

Figure 17). pAN296 was then used to transform a B. subtilis strain (PY79), selecting for 
chloramphenicol resistance. The majority of transformants result from a double 
crossover event that effectively substitutes the cat gene for the coaA gene. The 
transformed strain containing the coaA deletion -<at insertion grew normally due the 

25 presence of a second B. subtilis pantothenate kinase encoding gene described herein. 

EXAMPLE XIV: Identification and characterization of a second B. subtilis gene 
encoding pantothenate kinase activity 

As described in detail in the instant specification, in order to maximize 
30 pantothenate production, it is necessary to restrict the flow of pantothenate toward 
Coenzyme A (CoA), for example, by reducing the activity of pantothenate kinase, the 
first enzyme in the pathway from pantothenate to CoA. After Finding that deletion of 
the coaA gene from the chromosome of B. subtilis is not a lethal event (see Example 
XIII), it was concluded that B. subtilis must contain a second gene that encodes an active 
35 pantothenate kinase, since pantothenate kinase is an-essential enzyme activity. 
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A second pantothenate kinase-encoding gene was identified by complementing 
the E. coli strain YH1 (coaA15(Ts)) with a B. subtilis gene bank and selecting for 
transformants that were able to grow at 43°C. Found among the transformants were two 
families of plasmids that had overlapping restriction maps within each family, but not 
5 between the families. As expected, the restriction map of one family was identical to 
that predicted from the B, subtilis genome sequence for the homologue of the E. coli 
coaA gene (which we named coaA also, see above) and surrounding sequences. The 
other family had a restriction map that was completely non-overlapping with the first. 

DNA sequencing of the ends of the cloned inserts from the second family 
1 0 showed that the clones came from a region of the B. subtilis chromosome that includes 
the 3* end of the ftsU gene, the 5' end of the sul gene, and all of \heyacB, yacC, yacD 9 
cysK, pabB, pabA and pabC genes. None of the open reading frames of these cloned 
inserts showed homology to any known pantothenate kinase sequences, either 
prokaryotic or eukaryotic. 

15 Several deletions were created through the B. subtilis genomic sequences in the 

cloned inserts. Each deletion was tested for complementation of the E coli temperature 
sensitive pantothenate kinase. In particular, a deletion that removed all DNA between a 
Stu I site in the cloning vector and a Swa I site in the yacC gene, leaves yacB as the only 
intact open reading frame in the cloned insert (see Figure 21). This deleted plasmid still 

20 complemented the E. coli pantothenate kinase mutant. However, another deletion that 
removed DNA from the Swa I site in yacC through a Bstl 1071 site in the (already 
truncated) fisH gene, could not complement the E. coli pantothenate kinase mutant. 
From these results, it was concluded that iheyacB open reading frame was responsible 
for the complementation activity. To confirm that yacB is a pantothenate kinase gene, 

25 the yacB ORF plus 1 12 base pairs of downstream flanking sequence was amplified by 
PCR in two independent reactions and cloned downstream of a constitutive promote to 
give plasmids pAN341 and pAN342 (Figure 22). Both pAN341 and pAN342 
complemented the defect in YH1 at 44°C, while a control plasmid, which has the same 
backbone, but expresses panBCD instead of yacB did not. This confirmed that IheyacB 

30 open reading frame was responsible for the complementation of YH 1 . 

As such, a novel gene that encodes pantothenate kinase activity in B. subtilis has 
been discovered that is not related by homology to any previously known pantothenate 
kinase gene. This gene has been renamed coaX t as a second, alternative gene that 
encodes an enzyme that catalyzes the first step in the pathway from pantothenate to 
35 CoaA. Deletion of coaX by methods described above for deleting coaA, in conjunction 
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with reduction in the activity of the CoaA enzyme, provides a means to reduce 
pantothenate kinase activity to the desired level. 

Several homologues of the B. subtilis coaX gene were identified by homology 
searching of various publically available databases using the published yacB (coaX) 

5 open reading frame sequence and predicted amino acid sequence<as set forth in SEQ ID 
NOs:84 and 85 respectively). In two cases (Mycobacterium tuberculosis and 
Streptomyces coelicolor) the homologous coaX genes are adjacent to, or almost adjacent 
to, pantothenate biosynthetic genes, consistent with these homologs having a role in 
pantothenate metabolism. The CoaX proteins show no homology to the CoaA family of 

1 0 pantothenate kinases, nor to the eukaryotic family of pantothenate kinases exemplified 
by PanK of Saccharomyces cerevisiae. , 

Alignment of the amino acid sequences of several bacterial CoaX homologs with 
the amino acid sequence predicted from translating the B. subtilis yacB ORF described 
in the published 5. subtilis genome sequence revealed that the CoaX proteins from other 
1 5 bacteria contained additional amino acid residues at their carboxy-terminal ends. 
Moreover, these extensions beyond the end of the predicted amino acid for the B. 
subtilis gene product contained two relatively well conserved segments of sequence. 

Translation of nucleotide sequences just downstream from the stop codon of the 
B. subtilis yacB ORF in a different reading frame revealed the existence of amino acid 
20 sequences very similar to the carboxy-terminal extensions of the other bacterial CoaX 
proteins. It is thus believed that an error exists in the published DNA sequence of the B. 
subtilis yacB ORF sequence that causes a frame shift leading to an artifactual 
downstream amino acid sequence and premature termination. 

The PCR-generated sequences of A subtilis CoaX in pAN341 and pAN342 
25 (described above) contain enough downstream flanking sequence to encode the putative 
carboxy-terminal extension described above, which is consistent with the result that the 
clones were functional in the complementation assay. However when the 3' PCR 
primer was positioned to include only the shorter yacB ORF predicted from the 
published sequence, but not to include the putative carboxy-terminal extension, then the 
30 resulting plasmids, pAN329 and pAN330 (similar in structure to pAN341 and pAN342; 
see Figure 22), did not complement the defect in YH1 . This result -supports the notion 
that the published yacB coding sequence contains a frame-shift error, and that the 
carboxy-terminal end of CoaX is necessary for pantothenate kinase activity. The 
predicted correct nucleotide sequence for B. subtilis coaX is set forth as SEQ ID NO: 19 
35 and the translated amino acid sequence is set forth as SEQ ID NO:9. A multiple 
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sequence alignment of the CoaX amino acid sequences of B. subtilis and 1 1 homologues 
thereof is set forth in Figure 23 . 

EXAMPLE XV: Generation of mutant coaA genes encoding pantothenate kinase 
5 having reduced or temperature sensitive activities 

This Example describes strategies for modifying the coaA gene (£*., by 
introducing point mutations) to reduce the activity of pantothenate kinase after coaX is 
deleted from the genome. 

10 Cloning and sequencing of the temperature sensitive allele of the E. coli coaA gene. 

Two E. coli strains, each exhibiting a different mutant CoaA phenotype, were 
obtained from the E. coli Genetic Stock Center. Strain DV62 contains the coaA15(Ts) 
allele, and DV79 contains the coaA16(Fr) mutation. DV62 is temperature sensitive at 
43 °C and produces a pantothenate kinase that is temperature sensitive. DV79 was 

1 5 obtained by reversion of DV62 to temperature resistance, and it produces a temperature 
stable, feedback resistant pantothenate kinase activity. Since the DNA sequences of 
these alleles are not available in the literature, the coaA genes from the two mutant 
strains were cloned by PCR and sequenced, in addition to a coaA gene from a strain that 
is wild type at the coaA locus, MM294. The PCR primer at the 5' end was designed to 

20 include the start codon plus four bases upstream, and added an arbitrarily chosen 
ribosome binding site (RBS). The three PCR generated fragments were each ligated 
between theXbaland BamHI sites of pAN229 to give pAN284 (from coaA15(Ts)\ 
pAN285 (from wild type coaA), and pAN286 (from coaA16(Fr)). pAN229 is a low 
copy E. coli vector that provides expression from the P i5 promoter and that can integrate 

25 by single crossover at bpr in B. subtilis with tetracycline selection. 

All three plasmids were transformed into the £. coli strain YH1 for 
complementation testing. All three plasmids complemented the temperature sensitive 
coaA mutation in E. coli YH1 . It is presumed that the coaA15(Ts) gene in pAN284 is 
probably significantly overexpressed relative to the normal chromosomal gene, such that 
30 the overproduction compensates for the temperature sensitive defect. Complementation 
of a defect by overproduction is a well-documented phenomenon in E. coli. 

The coaA coding regions from pAN284, 285, and 286 were subcloned into 
pGEM7 to give pAN306, 307, and 308, respectively, for DNA sequencing. As 
expected, the DNA sequence of the insert in pAN307 (from wild type coaA) matched 
35 the coaA sequence from the £. coli genome database (GenBank™). The sequence from 
pAN306 contains a single base change that causes a SI 76L substitution (i.e., a Ser -> 
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Leu substitution in the amino acid sequence set forth as SEQ ID NO:2). Interestingly, 
the DNA sequence of the pAN308 insert, derived from the feedback resistant strain, was 
identical to that derived from its temperature sensitive parent (represented in pAN3,06). 
This is in accord with the genetic data that indicates that the reversion of the temperature 
5 sensitive mutation occurred at a second site unlinked to the coaA gene. 

The S176L mutation, predicted to cause the temperature sensitive defect in E. 
coli pantothenate kinase, changed a serine residue that is conserved in all known or 
suspected bacterial coaA encoded pantothenate kinases, including that of B. subtil is {see 
SEQ ID NO:3 and refer to alignment). Based on this, a serine to leucine change at the 

10 homologous residue in the B. subtilis pantothenate kinase is predicted to result in either 
a temperature sensitive enzyme or one which is less active. Accordingly, to produce a 
mutant B. subtilis coaA gene, this specific change was introduced into the B. subtilis 
coaA gene. The mutant version is installed in the chromosome of a ft subtilis strain 
deleted for coaX, for example, and the recombinant microorganism is checked for 

1 5 temperature sensitivity (e.g. , reduced growth at 43°C). The mutation is then installed 
into a pantothenate overproducing strain, preferably a strain deleted for the above 
mentioned coaX gene by standard methods to give strains favorable for pantothenate 
production in ft subtilis , i.e, a strain that has reduced pantothenate kinase activity under 
typical fermentation conditions. 

20 

Additional coaA point mutations resulting in reduced pantothenate kinase activity 

Of course it is expected that many other point mutations or combinations of more 
than one point mutation in B. subtilis coqA will also lead to reduced activity. 
Appropriate mutations can be generated by mutagenic polymerase chain reaction and in 
25 vitro recombination, and identified by screening for alleles that poorly complement the 
E. coli coaA15(Ts) mutant. An example ofsuch a mutation of this type is a tyrosine to 
histidine substitution at amino acid 181 of B. subtilis coaA 9 generated by mutagenic 
polymerase chain reaction (see SEQ ID NO:3 and first line of the alignment of Figure 
24). 

30 Isolate pAN282A was derived from the middle-sized B. subtilis coaA open 

reading frame described in Example XII. pAN282A complemented the £ coli 
coaA15(Ts) mutant very poorly, but nonetheless at a level that was detectable above 
background. As was done for the E. coli coaA clones, the open reading frame from 
pAN282A was subcloned into pGEM7 to give pAN303. The DNA sequence of the 

35 insert in pAN303 showed a single base change that led to a tyrosine to histidine amino 
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acid change at the tyrosine corresponding to Yl 81 of SEQ ID NO:3. This tyrosine 
residue is conserved in all bacterial coaA genes/homologues present in GenBank (Figure 
24). This tyrosine residue and the serine that is altered in the K coli temperature 
( sensitive pantothenate kinase described above are separated by only three amino acid 
5 residues in a region which is highly conserved in bacterial pantothenate kinases whereas 
the DNA sequence of a second isolate of the middle-sized open reading frame, from 
pAN282B, was identical to the wild type sequence from the B. subtilis genome 
sequencing project. The single base change found in pAN303 probably occurred during 
PCR amplification of the coaA gene. If this variant of coaA2 has sufficient residual 
1 0 biological activity in B. subtilis, it may be useful in the future for providing reduced 
pantothenate kinase activity. 

A preferred plasmid that can serve as a basis for mutagenizing the coaA open 
reading frame is pAN294 (see e.g., Figure 25 and Example XII). Briefly, mutagenic 
PCR is performed using pAN294 as a template and variants of coaA having reduced 

1 5 pantothenate kinase activity are screened as described above. Alternatively, mutations 
such as the one isolated in pAN282A can be installed into pAN294. The desired 
mutation is then introduced into the chromosome of a B. subtilis strain by transformation 
with the appropriate pAN294 derivative and selected for chloramphenicol resistance at 5 
mg/L. Among the resulting transformants will be isolates that contain the desired 

20 mutation. 

In a similar fashion, mutations that reduce the activity of the CoaX enzyme can 
be generated and identified, and such mutations used for optimizing pantothenate 
production by reducing CoA production as described above. 

25 EXAMPLE XVI: Deleting the second pantothenate kinase gene, coaX gene from 
B. subtilis 

With the knowledge gained above concerning the existence and nature of coaX, 
one can create a deletion of the coaX open reading frame from the B. subtilis 
chromosome that will remove the encoded activity, and that will not adversely affect the 
30 expression of the genes downstream from coaX. In such a deleted strain, the coaA gene 
will be the only gene that encodes pantothenate kinase. 
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To delete the coaXgene from B. subtilis, plasmid pAN336<SEQ ID NO:92), 
which contains upstream and downstream homology for double -crossover, was 
constructed with a kanamycin resistance gene replacing most of the coaX ORF (Figure 
26). Strain PY79 was transformed to kanamycin resistance by pAN336, and an isolate 

5 confirmed to have resulted from a double crossover by PCR wa$ named PA876. As 
predicted, deletion of coaX by itself is not lethal for B. subtilis. Furthermore, 
chromosomal DNA from PA876 would not transform competent PA861 (PY79 JcoaA 
::cat) to kanamycin resistance. These results indicate that it is the combination of 
. AcoaAv.cat and AcoaX :: kan that is lethal for B. subtilis, confirming that B. subtilis 

10 contains two unlinked ^genes that encode pantothenate kanase, coaA and coaX, and that 
either gene alone is capable of supplying sufficient pantothenate kinase for a normal rate 
of growth. 

EXAMPLE XVII: Construction of a plasmid designed to allow directed 
1 5 mutagenesis of the B. subtilis coaA gene 

In order to easily introduce mutated coaA genes into the B. subtilis chromosome, 
it was necessary to install an antibiotic resistance gene adjacent to the coaA gene. This 
was accomplished by joining together in the vector pGEMS three DNA fragments: (1) a 
3.4 kb DNA sequence containing 2.5 kb of genomic sequence upstream from coaA and 

20 the coaA open reading frame(s); (2) a 1 .1 kb DNA sequence containing a 

chloramphenicol resistance -gene (cat); and (3) a 1 .4 kb DNA sequence comprising a 
region downstream from the operon that contains coaA. The resulting plasmid, named 
pAN294, effectively replaces the open reading frame yqjT (the open reading frame just 
downstream from coaA) with the cat gene, with enough homology flanking both sides of 

25 the cat gene to allow double recombination into the B. subtilis chromosome (Figure 25). 
pAN294 was transformed into B. subtilis strain P Y79, selecting for chloramphenicol 
resistance at 5 mg/1 to give strains PA836 and PA837, which are presumably identical. 
PA836 and 837 were checked by diagnostic PCR to show that the cat gene had 
integrated by double crossover, as opposed to single crossover. PA836 and PA837rgrow 

30 normally, leading to the conclusion that the open reading frame yqjT is not essential 
(i.e., the yqjT open reading frame could be deleted from strains PA836 and PA837 with 
no significant effect on growth or pantothenate production). Thus, variant alleles (/.e., 
mutations) of the coaA gane can be introduced into pAN294 and the resulting plasmids 
can be used to introduce the variant alleles into the chromosome of, for example, a B. 

35 subtilis strain. 
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EXAMPLE XVIII: Generation of mutant coaX genes encoding pantothenate 
kinase having reduced or temperature sensitive activities 

Mutant coaX genes are generated by introducing point mutations into the gene 
• and testing the resulting mutants for the ability to complement the £ coli YH1 strain as 
5 described in Example XII. Preferred mutations in the coaX gene seqeunces are those 
that encode a substitution of a residue conserved among CoaX sequences from a variety 
of bacterial sources (e.g., a conserved residue set forth in Figure 23). Alternatively, 
random mutations in the coaXgene sequence are generated by mutagenic PCR and in 
vitro recombination and identified by screening for alleles that poorly complement the 
1 0 E. coli coaA15(Ts) mutant. 

Mutants so generated {i.e., mutants having reduced coaX activity) can be further 
engineered such that the endogenous coaA gene is deleted (as described in Example 
XIII). CoaX reduced-activity mutants can also be further engineered to contain reduced- 
activity CoaA gene products as described in Example XV. 

15 

EXAMPLE XIX: Enhanced Production of Panto-Compounds Using Bacteria 
Having Deletions in One or More Pantothenate Biosynthetic Enzymes 

If the desired panto-compound is not pantothenate, then an appropriate deletion 

20 of one or more of the pantothenate biosynthetic genes from a pantothenate 

overproducing strain will provide a strain that produces said desired panto-compound. 
In this example, the desired panto-compound is pamoate. Starting with, for example, 
strain PA236, PA3 13 or PA824 either one or both of the panC and panD genes is 
deleted. In another example, ketopantoate is the desired panto-compound. Starting 

25 with, for example strain PA244, PA245 or PA824 one, two or all of the i/vC, panEl, 
panC and panD genes are deleted from the starting strain. If p-alanine is the desired 
panto-compound, then panB and panC can be deleted, preferably in a fashion that leaves 
an in frame fusion of a small portion of the 5' end of panB with a small portion of the 
3' end ofpawC, from the strain PA221, PA235, PA245, or PA313. In all of the above- 

30 mentioned examples, the panto-compound producing strain will be a pantothenate 
auxotroph. Accordingly, the growth medium requires sufficient pantothenate for 
adequate growth. Vectors designed to overexpress panD as described above are then 
transformed into the above strains to further enhance P-alanine production. 

The above-mentioned deletions are accomplished by methods well-known to 

35 those skilled in the art, for example, by insertion of an antibiotic resistance gene and 
removing sufficient sequence from the target gene(s) to inactivate said target gene(s). 
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Alternatively, removal of targeted sequences is accomplished without simultaneous 
introduction of an antibiotic resistance gene in said target gene and then introduced by 
congression (co-transformation with any other appropriate selectable DNA sequence) 
followed by screening for the loss of function of said target -gene by replica plating. 



Table 24 : Strains (and corresponding phenotypes) for panto-compound product ion 

bpr locus Parent 



Nsmc 1 


PhpnA 

menu 


urug 






type 


resist. 


locus 


PA221 


Trp- 




P26panBCD 


PA222 






PispanBCD 


PA235 






P26panBCD 








*26 pono l~ L/ 


PA327 


Tip- 




P26panBCD 


PA328 


Trp- 




P26panBCD 


PA340 


Trp- 


Spc 


P26panBCD 


PA342 


Trp- 


Spc 


P26panBCD 


PA354 


Trp- 


Spc 


P26panBCD 


PA365 


Trp- 


Spc, 


P26panBCD 






Tet 




PA374 


Trp- 


Spc 


P26panBCD 


PA377 


Trp- 


Spc, 


P26panBCD 






Tet 




PA40I 


Trp- 




P26panBCD 


PA402 


Tip- 




P26panBCD 


PA403 


Tip- 




P26panBCD 


PA404 


Trp- 


Spc, 


P26panBCD 






Tet 




PA405 


Trp- 


Spc, 


P26panBCD 






Tet 




PA65I 


Trp- 


Spc 


P26panBC*D 


PA284 




Spc, 


P26panBCD 



panE locus iivD amyE locus 
locus 



RL-I 



P 26 panEl 


• 




PA221 


P26panE! 






PA221 


P26panEI 






PA235 


P26panEI 


P24ihBNC 




PA327 


P26panEl 


P26ilvBNC 




PA328 


P26panEl 


P26ilvD P26HvBNC 




PA342 


P26panEl 


P26ilvD P26ilvBNC 


P26panD423 


PA354 


P26panEI 


P26ilvD P26iivBNC 




PA340 


P26panEl 


P26ilvD P26ilvBNC 


P26panD423 


PA374 






P26panD423 


PA22I 






P26panD428 


PA221 






P26panD429 


PA22I 


P26panE! 


P26ilvBNC 


P26panD423 


PA340 


P26partEI 


P26ilvBNC 


P26panD423 


PA342 


P26panEI P26ilvD P26ilvBNC 




PA374 


P26panEl P26ilvD P26iivBNC 


P26panD423 


PA377 



Tet 



Equivalents Those skilled in the art will recognize, or be able to ascertain using no 
10 more than routine experimentation, many equivalents to the specific embodiments of the 
invention described herein. Such equivalents are intended to be^ncompassed by the 
following claims. 
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What is claimed: 

1 . A method of producing a panto-compound comprising culturing a 

, microorganism which overexpresses at least one Bacillus pantothenate biosynthetic 
5 enzyme under conditions such that the panto-compound is produced. 

2. The method of claim 1 , wherein the microorganism overexpresses at least 
one Bacillus subtilis pantothenate biosynthetic enzyme. 

10 3. The method of claim 1 or 2, wherein the pantothenate biosynthetic 

enzyme is selected from the group consisting of ketopantoate hydroxymethyltransferase, 
pantothenate synthetase, aspartate-a-decarboxylase and ketopantoate reductase. 

4. The method of any one of claims 1 to 3, wherein the microorganism 
1 5 overexpresses at least two pantothenate biosynthetic enzymes. 

5. The method of any one of claims 1 to 3, wherein the microorganism 
overexpresses at least three pantothenate biosynthetic enzymes. 

20 6. The method of any one of claims 1 to 5, wherein the panto-compound is 

selected from the group consisting of pantothenate, pantoate, ketopantoate and (3- 
alanine. 

7. A method of producing a panto-compound comprising culturing a 

25 ketopantoate reductase-overexpressing (KPAR-O) microorganism under conditions such 
that the panto-compound is produced. 

8. The method of claim 7, wherein the panto-compound is pantothenate or 
pantoate. 

30 

9. The method of claim 7 or 8, wherein the ketopantoate reductase is 
bacterial-derived. 



35 



1 0. The method of claim 7 or 8, wherein the ketopantoate reductase is 
derived from Bacillus. 
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11. The method of claim 7 or 8, wherein the ketopantoate reductase is 
derived from Bacillus subtilis. 

* 

12. The method of any one of claims 7 to 1 1, wherein theiCPAR-0 

5 microorganism further overexpresses at least one pantothenate biosynthetic enzyme in 
addition to overexpressing ketopantoate reductase. 

13. The method of claim 1 2, wherein the KPAR-O microorganism further 
overexpresses at least one of ketopantoate hydroxymethyltransferase, pantothenate 

1 0 synthetase and aspartate-a -decarboxylase. 

14. A method of producing pantothenate in a manner independent of 
precursor feed comprising culturing an aspartate-a-decarboxylase-overexpressing (AaD- 
O) microorganism having a deregulated isoleucine-valine (//v) pathway under conditions 

15 such that pantothenate is produced. 

1 5. A method of producing at least 2 .g/L pantothenate in a manner 
independent of aspartate or p-alanine feed comprising culturing an aspartate-a- 
decarboxylase-overexpressing (AaD-O) microorganism under conditions such that 

20 pantothenate is produced. 

1 6. A method of producing at least 2 g/L pantothenate in a manner 
independent of valine or a-ketoisovalerate feed comprising culturing a microorganism 
having a deregulated isoleucine-valine (/7v) biosynthetic pathway under conditions such 

25 that pantothenate is produced. 

17. A method of producing at least 30 g/L pantothenate in a manner 
independent of aspartate or P-alanine feed comprising culturing an aspartate-a- 
decarboxylase-overexpressing (AaD-O) microorganism under conditions such that 

30 pantothenate is produced. 

18. A method of producing at least 30vg/L pantothenate in a manner 
independent of valine or a-ketoisovalerate feed comprising culturing a microorganism 
having a deregulated isoleucine-valine (i/v) biosynthetic pathway under conditions such 

35 that pantothenate is produced. 
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1 9. A p-alanine independent high yield production method for producing 
pantothenate comprising culturing a manipulated microorganism under conditions such 
that pantothenate is produced at a significantly high yield. 

5 20. The method of any one of claims 14 to 19, wherein the microorganism ' 

overexpresses acetohydroxyacid synthetase or is transformed with a vector comprising 
an ilvBN nucleic acid sequence or an alsS sequence. 

2 1 . The method of any one of claims 1 4 to 1 9, wherein the microorganism 
1 0 overexpresses acetohydroxyacid isomeroreductase or is transformed with a vector 

comprising an ilvC nucleic acid sequence. 

22. The method of any one of claims 14 to 19, wherein the microorganism 
overexpresses dihydroxyacid dehydratase or is transformed with a vector comprising an 

1 5 ilvD nucleic acid sequence. 

23. The method of any one of claims 19 to 22, wherein the microorganism 
overexpresses aspartate-a-decarboxylase or is transformed with a vector comprising a 
panD nucleic acid sequence. 

20 

24. The method of any one of claims 14 to 23, wherein the microorganism 
further has a deregulated pantothenate biosynthetic pathway. 

25. The method of any one of claims 14 to 24, wherein the microorganism 
25 further has at least one mutant gene selected from the group consisting of a mutant avtA 

gene, a mutant ilvE gene, a mutant ansB gene and a mutant alsD gene. 

26. The method of claim 24, wherein the microorganism overexpresses any 
of ketopantoate hydroxymethyltransferase, ketopantoate reductase, pantothenate 

30 synthetase and aspartate-a-decarboxylase. 

27. The method of claim 24 or 26, wherein the microorganism is transformed 
with a vector comprising a panBCD nucleic acid sequence or a vector comprising a 
panEl nucleic acid sequence. 

35 
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28. The method of any one of claims 14 to 1 6 and 1 9 to 27, wherein 
pantothenate is produced at a level selected from the group consisting of a level-greater 
, than 1 Og/L, a level greater than 20g/L and a level greater than 40g/L. 

5 29. The method of claim 20, wherein the microorganism- overexpresses 

acetohydroxyacid synthetase derived from Bacillus or is transformed with a vector 
comprising an ilvBN nucleic acid sequence or an alsS nucleic acid sequence derived 
from Bacillus. 

1 o 30. The method of claim 21 , wherein the microorganism overexpresses 

acetohydroxyacid isomeroreductase derived from Bacillus or is transformed with a 
vector comprising an ilvC nucleic acid sequence derived from Bacillus. 

t 

3 1 . The method of claim 22, wherein the microorganism overexpresses 
1 5 dihydroxyacid dehydratase derived from Bacillus or is transformed with a vector 

comprising av ilvD nucleic acid sequence derived from Bacillus. 

32. The method of claim 23, wherein the microorganism overexpresses 
aspartate-a-decarboxylase derived from Bacillus or is transformed with a vector 

20 comprising zpanD nucleic acid sequence derived from Bacillus. 

33. The method of claim 24 or 26, wherein the microorganism overexpresses 
any of ketopantoate hydroxymethyltransferase, ketopantoate reductase, pantothenate 
synthetase and aspartate-a-decarboxylase derived from Bacillus. 

25 

34. The method of claim 27, wherein the vector comprises a panBCD nucleic 
acid sequence or a panEl nucleic acid sequence derived from Bacillus. 

35. A method of producing a panto-compound comprising contacting a 
30 composition comprising at least one pantothenate biosynthesis pathway precursor or 

isoleucine-valine biosynthesis pathway precursor with at least one isolated Bacillus 
enzyme selected from the group consisting of ketopantoate hydroxymethyltransferase, 
ketopantoate reductase, pantothenate synthetase and aspartate-a-decarboxylase, under 
conditions such that the panto-compound is produced. 

35 
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36. A method of producing P-alanine comprising culturing an aspartate-a- 
decarboxylase-overexpres$ing (AaD-O) microorganism under conditions such that p- 
alanine is produced. 

5 37. The method of claim 36, wherein the AaD-0 microorganism has a 

mutation in a nucleic acid sequence encoding a pantothenate biosynthetic enzyme 
selected from the group consisting of ketopantoate hydroxymethyltransferase, 
ketopantoate reductase and pantothenate synthetase. 

10 38. A method of producing p-alanine comprising contacting a composition 

comprising aspartate with an isolated Bacillus aspartate-a-decarboxy 1 ase enzyme under 
conditions such that P-alanine is produced. 

39. A method for enhancing production of a panto-compound comprising 

1 5 culturing a mutant microorganism having a mutant coaX gene under conditions such that 
the panto-compound production is enhanced. 

40. The method of claim 39, wherein said recombinant microorganism has a 
mutant coaA gene. 

20 

41 . A method of producing a panto-compound comprising a pantothenate 
kinase mutant microorganism under conditions such that the panto-compound is 
produced at a significantly high yield. 

25 42. The method of claim 4 1 , wherein said mutant microorganism has a 

mutant coaA gene. 

43. The method of claim 41, wherein said mutant microorganism has a 
mutant coaX gene. 

30 

44. The method of claim 4 1 , where said mutant microorganism has a mutant 
coaA and coaX gene. 

45. The method of any one of claims 39 to 44, wherein said panto-compound 
35 is selected from the group consisting of ketopantoate, pamoate or pantothenate. 
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46. The method of any one of claims 39 to 44; wherein said panto-compound 
is pantothenate. 

i 

47. The method of any one of claims 39 to 44, wherein said panto-compound 
5 is produced at a level selected from the group-consisting of a level greater than 1 Og/L, a 

level greater than 20g/L and a level greater than 40g/L. 

48. The method of any one of claims 39 to 44, wherein said recombinant 
microorganism further has a deregulated pantothenate biosynthetic pathway or further 

10 has a deregulated isoleucine-valine (/7v) biosynthetic pathway. 

49. The method of claim any one of claims 39 to 44, wherein said 
recombinant microorganism further overexpresses panD and panE. 

1 5 so. The method of any one of claims 39 to 44, wherein said recombinant 

, microorganism further has at least one mutant gene selected from the group consisting 
of a mutant avtA gene, a mutant /7v£gene, a mutant ansB gene and a mutant alsD gene. 

51. A method for enhancing production of a panto-compound comprising 

20 culturing a microorganism that has a deregulated pantothenate biosynthetic pathway and 
that also has a mutation that results in reduced pantothenate kinase activity under 
conditions such that the panto-compound production isenhaneed. 

52. A method for identifying compounds which modulate pantothenate 

25 kinase activity comprising contacting a recombinantcell expressing pantothenate kinase 
encoded by the coa^gene with a test compound and determining the ability of the test 
compound to modulate pantothenate kinase activity in said cell. 

53. The method of claim 52, wherein said cell further comprises a mutant 
30 coaA gene encoding a pantothenate kinase having reduced activity. 

54. The method of any one of claims 1 to 5 1 , wherein the microorganism is 
Gram positive. 

35 55. The method of any one of claims 1 to 51, wherein the microorganism is 

Gram negative. 
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56. The method of any one of claims 1 to 5 1 , wherein the microorganism is a 
microorganism belonging to a genus selected from the group consisting of Bacillus, 
Cornyebacterium, Lactobacillus, Lactococci and Streptomyces. 

5 57. The method of any one of claims 1 to 5 1 and 54 to 56, wherein the 

microorganism is of the genus Bacillus. 

58. The method of any one of claims 1 to 5 1 and 54 to 57, wherein the 
microorganism is Bacillus subtilis. 

10 

59. The method of any one of claims 1 to 13, 35, 39 to 51 and 54 to 58, 
further comprising recovering the panto-compound. 

60. The method of any one of claims 14 to 34 and 54 to 58, further 
1 5 comprising recovering the pantothenate. 

61 . The method of any one of claims 1 to 14, 35 , 39 to 46, 48 to 5 1 and 54 to 
59, wherein the panto-compound is produced at a level greater than 2 g/L. 

20 62. A recombinant microorganism which overexpresses at least one Bacillus 

pantothenate biosynthetic enzyme. 

63. The recombinant microorganism of claim 62, which overexpresses at 
least one Bacillus subtilis pantothenate biosynthetic enzyme. 

64. The recombinant microorganism of claim 62 or 63, wherein the 
pantothenate biosynthetic enzyme is selected from the group consisting of ketopantoate 
hydroxymethyltransferase, pantothenate synthetase, aspartate-a-decarboxylase and 
ketopantoate reductase. 

65. The recombinant microorganism of any one of claims 62 to 64, wherein 
the pantothenate biosynthetic enzyme is ketopantoate reductase. 

66. A recombinant microorganism which overexpresses aspartate-a- 
35 decarboxylase and has a deregulated isoleucine- valine </7v) biosynthetic pathway. 
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67. A recombinant microorganism having a mutant coaX gene, said mutant 
cooTgene encoding reduced pantothenate kinase activity in said microorganism. 

68. The recombinant microorganism of t:laim 67 further having a mutant 

5 coaA gene, said mutant coaA gene encoding reduced pantothenate kinase activity in said 
microorganism. 

69. A recombinant microorganism having a mutant coaA'gene and optionally 
having a mutant coaA gene, said mutant microorganism having reduced pantothenate 

10 kinase activity as compared to a microorganism having vtild-type^cooA and cooATgenes. 

70. A recombinant microorganism comprising a vector comprising an 
isolated coaA" gene. 1 

15 71. A recombinant microorganism that overproduces a panto-compound, the 

microorganism having a deregulated pantothenate biosynthetic pathway and having at 
least one mutation that results in a decrease in the capacity of the microorganism to 
synthesize Coenzyme A '(CoA). 

20 72. The recombinant microorganism of claim 7 1 , having at least one 

mutation that results in a reduced level of pantothenate kinase activity. 

73. The recombinant microorganism of claim 72, having a mutation in a 
coaA gene, or homologue thereof, that results in a reduced level of CoaA enzyme 

25 activity. 

74. The recombinant microorganism of claim 72, having a mutation in a 
coaX gene, or homologue thereof, that results in a reduced level of CoaX enzyme 
activity. 

30 

75. The recombinant microorganism of -claim 72, having a mutation in a 
coaA gene, or homologue thereof, and having a mutation in a coaX gene, or homologue 
thereof, the mutations resulting in reduced levels of CoaA -enzyme activity and reduced 
CoaX enzyme activity. 

35 

76. The recombinant microorganism of any one of claims 66 to 70 which 
further has a deregulated pantothenate biosynthetic pathway. 
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77. The recombinant microorganism of any one of claims 62 to 65 and 67 to 
75, further having a deregulated isoleucine-valine (z/v) biosynthetic pathway. 

5 78. The recombinant microorganism of any one of claims 62 to 77, which is 

Gram positive. 

79. The recombinant microorganism of claim 78 belonging to a genus 
selected from the group consisting of Bacillus, Cornyebacterium, Lactobacillus, 

1 0 Lactococci and Streptomyces. 

80. The recombinant microorganism of claim 79 belonging to the genus 

i 

Bacillus. 

15 81. The recombinant microorganism of claim 80 which is Bacillus subtilis. 

82. A recombinant microorganism selected from the group consisting of 
PA221, PA235, PA236, PA313, PA410, PA402, PA403, PA41 1, PA412, PA413, 
PA303, PA327, PA328, PA401, PA340, PA342, PA404, PA405, PA374, PA354, 

20 P A365, PA377, PA65 1 and PA824. 

83. A recombinant vector for use in the production of panto-compounds 
comprising a nucleic acid sequence which encodes at least one Bacillus pantothenate 
biosynthetic enzyme operably linked to regulatory sequences. 

25 

84. The vector of claim 83, comprising a nucleic acid sequence which 
encodes at least one Bacillus subtilis pantothenate biosynthetic enzyme. 

85. The vector of claim 84, wherein the nucleic acid sequence encodes at 
30 least one of ketopantoate hydroxymethyltransferase, pantothenate synthetase, aspartate- 

a-decarboxylase and ketopantoate reductase. 

86. A recombinant vector comprising at least one nucleic acid sequence 
selected from the group consisting of SEQ ID NO:23, SEQ ID NO:25, SEQ ID NO:27, 

35 SEQ ID NO:29 and SEQ ID NO:59. 
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87. The vector of claim 84, wherein the nucleic acid sequence encodes 
ketopantoate reductase. 

88. A vector comprising a mutant coaVgene, said mutant encoding a 
5 pantothenate kinase enzyme having reduced activity. 

89. A vector comprising an isolated coaXgene. 

90. A vector comprising an isolated Bacillus coaX gene. 

91. A vector comprising an isolated Bacillus subtilis coaX gene. 

92. The vector of any one of claims «6 and 89 to 9 1 , which furthercomprises 
regulatory sequences. 

93. The vector of any one of claims 83 to 85, 87 and 92, wherein the 
regulatory sequences comprise a cohstitutively active promoter. 
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94. The vector of claim 93, wherein the constitutively active promoter 
20 comprises ? wg (SEQ ID NO:41), />,j (SEQ ID NO:39) or P 26 <SEQ ID NO:40) 

sequences. 

95 . The vector of claim 83 , wherein the regulatory sequences-comprise at 
least one artificial ribosome binding site<RBS). 
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96. The vector of claim 95, wherein the artificial RBS comprises a nucleotide 
sequence selected from the group consisting of SEQ ID NO:49, SEQ IDNO:50, SEQ ID 
NO:51, SEQ ID NO:52, SEQ ID NO:53, SEQ ID NO:54, SEQ ID NO:55, SEQ ID 
NO:56 and SEQ ID NO:57. 

97. A vector selected from the group consisting of pANO04,<pAN005, 
pAN006, pAN236, pAN423, pAN428, pAN429, pAN44l, pAN442, pAN443, pAN251 , 
pAN267, pAN256, pAN257, pAN263, pAN240, pAN294, pAN296, pAN336, pAN341 
and pAN342. 

98. A recombinant microorganism -comprising the vector of claim 156 or 93. 



oucrw»irv 



WO 01/21772 



PCT/US00/25993 



-111 - 



99. An isolated nucleic acid molecule which encodes at least one Bacillus 
pantothenate biosynthetic gene. 

100. The isolated nucleic acid molecule of claim 99 which encodes at least 
5 one Bacillus subtilis pantothenate biosynthetic gene. 

101. The isolated nucleic acid molecule of claim 99 or 1 00 which encodes 
ketopantoate reductase. 

10 1 02. An isolated Bacillus pantothenate biosynthetic enzyme polypeptide. 



1 03. An isolated Bacillus subtilis pantothenate biosynthetic enzyme 



• 



polypeptide. 
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1 04. An isolated Bacillus ketopantoate reductase polypeptide. 



105. An isolated Bacillus subtilis ketopantoate reductase polypeptide. 



106. An isolated Bacillus aspartate-a-decarboxylase polypeptide. 



20 



107. An isolated Bacillus subtilis aspartate-a-decarboxylase polypeptide. 



108. An isolated nucleic acid molecule comprising a mutant coaX gene. 



25 



109. An isolated nucleic acid molecule comprising a coaA'gene. 



110. An isolated pantothenate kinase protein encoded by a coaX gene. 
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Figure 2. Plasmid pAN240, containing sequences ligated upstream of 
the P 26 panBCD cassette, equivalent to the integrated version in strain 
PA221. 
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Figure Plasmid pAN236, tontaining an integratable and amplifiable P26-RBS2-panEl 
expression cassette. 
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FtgtireS Construction ofpA\423 
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Figured Construction of pAN426 and pAN427. 
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Fi$ure 7 Construction o/pAS42S and pAS'429. 
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Figun* 8. Construction of p. \S4M. 
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Figure 9. Construction ofpA\44L 
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Figure 10. Construction ofpAS440. 
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Figure \ [ Structure ofpAN251, a plasmid designed to integrate a single copy of P 26 panEl at the 
pan El locus by double crossover. 
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Figure /*{ Structure o/pANZ63, designed to stably integrate a single copy of 
P26 ilvD at the ilvD locus. 



WO 01/21772 



PCTAJS00/25993 



17/31 



17/31 



pantothenate 



ATP 



AOP <r 

Y 

4'phosphopantothenate 



pantothenate kinase 

{ coaA, panK, «coaX) 



CTP, cysteine 



4'-phosphopantothenytaysteine 



4'-phosphopantetheine 
ATP — * 



. coaD 

Dephosphocoenzyme A 
ATP 

ADP <T 

> 

"Coenzyme A 




Holo-ACP 



WO 01/21772 



18/31 



PCT/US00/25993 




RNonrr-.in- ,wn (\\owr>v> 1 > 



WO 01/21772 



19/31 



PCT/US00/25993 




WO 01/21772 



20/31 



PCT/US00/25993 



Figure / <f Structure ofpAN281, a plasmidfor expressing B. subtilis coaA after integration 
at the bpr locus. pAN282 andpAN283 have similar structures. 
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Figure 2(o Structure ofpAN336 9 a plasmid designed to delete B. subtilis coaX from the 
chromosome and replace it with a kanamycin resistance gene. 
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SEQUENCE LISTING 

<110> OMNIGENE BIOPRODUCTS 

<120> METHODS AND MICROORGANISMS FOR PRODUCTION OF 
PAN TO -COM POUNDS . 

<130> BGI-141CPPC 

<140> 
<141> 

<150> USSN 09/400,494 
<151> 1999-09-21 

<150> USSN 60/210,072 
<151> 2000-06-07 

<150> USSN 60/221,836 ( 
<151> 2000-07-28 

<150> USSN 60/221,836 
<151> 2000-08-24 

<160> 94 

<170> Patentln Ver. 2.0 

<210> 1 
<211> 311 
<212> PRT 

<213> Haemophilus influenzae 
<400> 1 

Met Glu Phe Ser Thr Gin 
1 5 

Gin Trp Ala Glu Leu Arg 
20 

Asp Leu Lys Pro Leu Leu 
35 

Val Ser Thr lie Tyr Leu 
50 

Asp Glu Asn Leu His Arg 
65 70 

Asn Asn Ala Lys Thr Pro 
85 

Val Gly Lys Ser Thr Ser 
100 

Trp Pro Thr Glu Arg Lys 
115 



Gin Thr Pro Phe Leu Ser Phe Asn Arg Glu 
10 15 

Lys Ser Val Pro Leu Lys Leu Thr Glu Gin 
25 30 

Gly Phe Asn Glu Asp Leu Ser Leu Asp Glu 
40 45 

Pro Leu Thr Arg Leu lie Asn Tyr Tyr lie 
55 60 

Gin Thr Val Leu His Arg Phe Leu Gly Arg 
75 80 

Tyr He lie Ser He Ala Gly Ser Val Ala 
90 95 



Ala Arg He Leu Gin Ser Leu Leu Ser His 
105 110 

Val Asp Leu He Thr Thr Asp Gly Phe Leu 
120 125 



WO 01/21772 PCTAJS00/25993 

-2- 

Tyr Pro Leu Asn Lys Leu Lys Gin Asp Asn Leu Leu Gin Lys Lys <51y 
130 135 140 

Phe Pro Val Ser Tyr Asp Thr Pro Lys Leu Il« Arg Phe Leu Ala Asp 
145 150 155 160 

Val Lys Ser Gly Lys Ser Asn Val Thr Ala Pro He Tyr Ser His Leu 
165 170 175 

Thr Tyr Asp He He Pro Asp Lys Phe Asp Val Val Asp Lys Pro Asp 
180 185 190 

He Leu He Leu Glu Gly Leu Asn Val Leu Cln Thr <Uy Asn Asn Lys 
195 200 205 

Thr Asp Gin Thr Phe Val Ser Asp Phe Val Asp Phe Ser He Tyr Val 
210 215 220 

Asp Ala Glu Glu Lys Leu Leu Lys Glu Trp Tyr He Lys Arg Phe Leu 
225 230 235 240 

Lys Phe Arg Glu Ser Ala Phe Asn Asp Pro Asn Ser Tyr Phe Lys His 
24 5 250 255 

Tyr Ala Ser Leu Ser Lys -Glu -Glu Ala He Ala Thr Ala Ser Lys lie 
260 • 2S5 270 

Trp Asp Glu He Asn Gly Leu Asn Leu Asn "Gin Asn He Leu Pro Thr 
275 280 285 

Arg Glu Arg Ala Asn Leu He Leu Lys Lys Gly His Asn His <51n Val 
290 295 300 

Glu Leu He Lys Leu Arg Lys 
305 310 



<210> 2 
<211> 316 
<212> PRT 

<213> Escherichia coli 
<400> 2 

Met Ser He Lys <31u Gin Thr Leu *1et Thr Pro Tyr Leu -Gin Phe Asp 
I 5 10 15 

Arg Asn Gin Trp Ala Ala Leu Arg Asp Ser Val Pro Met Thr Leu Ser 
20 . 25 30 

Glu Asp Glu He Ala Arg i*eu Lys -Gly lie Asn <31u Asp Leu Ser Leu 
35 40 45 

Glu Glu Val Ala Glu lie Tyr Leu Pro Leu Ser Arg Leu Leu Asn Phe 
50 *55 60 

Tyr He Ser Ser Asn Leu Arg Arg Gin Ala Val Leu "Glu <Gl*i Phe Leu 
65 70 75 80 
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Gly Thr Asn Gly <Sln Arg lie Pro Tyr lie lie Ser lie Ala Gly Ser 
85 90 95 

Val Ala Val Gly Lys Ser Thr Thr Ala Arg Val Leu Gin Ala Leu Leu 



Ser Arg Trp Pro Glu His Arg Arg Val Glu Leu lie Thr Thr Asp Gly 
115 120 125 

i 

Phe Leu His Pro Asn Gin Val Leu Lys Glu Arg Gly Leu Met Lys Lys 
130 135 140 

Lys -Gly Phe Pro Glu Ser Tyr Asp Met His Arg Leu Val Lys Phe Val 
145 150 155 160 

Ser Asp Leu Lys Ser Gly Val Pro Asn Val Thr Ala Pro Val Tyr Ser 
165 170 175 

His Leu lie Tyr Asp Val lie Pro Asp Gly Asp Lys Thr Val Val Gin 
180 185 190 

Pro Asp lie Leu lie Leu Glu Gly Leu Asn Val Leu' Gin Ser Gly Met 
195 200 205 

Asp Tyr Pro His Asp Pro His His Val Phe Val Ser Asp Phe Val Asp 
210 215 220 

Phe Ser lie Tyr Val Asp Ala Pro Glu Asp Leu Leu Gin Thr Trp Tyr 
225 230 235 240 

lie Asn Arg Phe Leu Lys Phe Arg Glu Gly Ala Phe Thr Asp Pro Asp 
245 250 255 

Ser Tyr Phe His Asn Tyr Ala Lys Leu Thr Lys "Glu Glu Ala lie Lys 
260 265 270 

Thr Ala Met Thr Leu Trp Lys Glu lie Asn Trp Leu Asn Leu Lys Gin 
275 280 285 

Asn lie Leu Pro Thr Arg <31u Arg Ala Ser Leu lie Leu Thr Lys Ser 
290 295 300 

Ala Asn His Ala Val Glu Glu Val Arg Leu Arg Lys 
305 310 315 



<210> 3 
<211> 319 
<212> PRT 

<213> Bacillus subtilis 
<400> 3 

Met Lys Asn Lys Glu Leu Asn Leu His Thr Leu Tyr Thr Gin His Asn 
1 5 10 15 

Arg Glu Ser Trp Ser Gly Phe Gly Gly His Leu Ser lie Ala Val Ser 



100 



105 



110 



20 



25 



30 
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Glu Glu Glu Ala Lys Ala Val Glu <51y Leu Asn Asp Tyr Leu Ser Val 
35 40 45 

Glu "Glu Val <31u Thr He Tyr He Pro Leu Val Arg Leu Leu His Leu 
50 55 60 

His Val Lys Ser Ala Ala -Glu Arg Asn Lys His Val Asn Val Phe Xeu 
€5 70 75 80 

Lys His Pro His Ser Ala Lys lie Pro Phe He lie Gly He Ala -Gly 
85 90 95 

Ser Val Ala Val Gly Lys Ser Thr Thr Ala Arg lie Leu Gin Lys Leu 
100 105 HO 

Leu Ser Arg Leu Pro Asp Arg Pro Lys Val Ser Leu He Thr Thr Asp 
115 120 125 

Gly Phe Leu Phe Pro Thr Ala Glu Leu Lys Lys Lys Asn Met Met Ser 
130 135 140 

Arg Lys Gly Phe Pro Glu Ser Tyr Asp Val Lys Ala Leu Leu -Glu Phe 
145 150 155 160 

Leu Asn Asp Leu Lys Ser Gly Lys Asp Ser Val Lys Ala Pro Val Tyr 
165 . . 170. 175 

Ser His Leu Thr Tyr Asp Arg Glu -Glu Gly Val Phe >Glu Val Val Glu 
180 185 r90 

Gin Ala Asp He Val He He Glu <51y He Asn Val Leu Gin Ser Pro 
195 200 205 

Thr Leu Glu Asp Asp Arg <31u Asn Pro Arg He Phe Val Ser Asp -Phe 
210 215 220 

Phe Asp Phe Ser He Tyr Val Asp Ala -Glu Glu Ser -Arg He Phe Thr 
225 230 235 240 

Trp Tyr Leu Glu Arg Phe Arg Leu Leu Arg Glu Thr Ala Phe -Gin Asn 
245 250 255 

Pro Asp Ser Tyr Phe His Lys Phe Lys Asp Leu Ser Asp -Gin ^Glu Ala 
260 265 270 

Asp Glu Met Ala Ala Ser He Trp Glu Ser Val Asn Arg Pro Asn Leu 
275 280 285 

Tyr Glu Asn He Leu Pro Thr Lys Phe Arg Ser Asp Leu He Leu Arg 
290 295 300 

Lys Gly Asp Gly His Lys Val -Glu <Glu Val Leu Val Arg Arg Val 
305 310 315 



<210> 4 
<211> 312 
<212> PRT 
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<213> Mycobacterium leprae 



<400> 4 

Met Pro Arg Leu Ser Glu Pro Ser Pro Tyr Val Glu Phe Asp Arg Lys 
15 10 15 

i 

Gin Trp Arg Ala Leu Arg Met Ser Thr Pro Leu Ala Leu Thr Glu -Glu 
20 25 30 

Glu Leu He Gly Leu Arg Gly Leu Gly Glu -Gin He Asp Leu Leu Glu 
35 40 45 

Val Glu Glu Val Tyr Leu Pro Leu Ala Arg Leu He His Leu Gin Val 
50 55 60 

Ala Ala Arg Gin Arg Leu Phe Ala Ala Thr Ala Glu Phe Leu Gly Glu 
65 70 75 80 

Pro Gin Gin Asn Pro Gly Arg Pro Val Pro Phe He He Gly Val Ala 
85 90 95 

Gly Ser Val Ala Val Gly Lys Ser Thr Thr Ala Arg' Val Leu Gin Ala 
100 105 110 

Leu Leu Ala Arg Trp Asp His His Thr Arg Val Asp Leu Val Thr Thr 
115 120 * 125 

Asp Gly Phe Leu Tyr Pro Asn Ala Glu Leu Gly Arg Arg Asn Leu Met 
130 135 140 

His Arg Lys Gly Phe Pro Glu Ser Tyr Asn Arg Arg Ala Leu Met Arg 
145 150 155 160 

Phe Val Thr Ser Val Lys Ser Gly Ala Asp Tyr Ala Cys Ala Pro Val 
165 170 175 

Tyr Ser His Leu Arg Tyr Asp Thr He Pro Gly Ala Lys His Val Val 
180 185 ~ 190 

Arg His Pro Asp He Leu He Leu Glu Gly Leu Asn Val Leu Gin Thr 
195 200 205 

Gly Pro Thr Leu Met Val Ser Asp Leu Phe Asp Phe Ser Leu Tyr Val 
210 215 220 

Asp Ala Arg He Gin Asp He Glu Gin Trp Tyr Val Ser Arg Phe Leu 
225 230 235 240 

Ala Met Arg Gly Thr Ala Phe Ala Asp Pro Glu Ser His Phe His His 
245 250 255 

Tyr Ser Ala Leu Thr Asp Ser Lys Ala lie He Ala Ala Arg Glu lie 
260 265 270 

Trp Arg Ser He Asn Arg Pro Asn Leu Val Glu Asn He Leu Pro Thr 
275 280 285 

Arg Pro Arg Ala Thr Leu Val Leu Arg Lys Asp Ala Asp His Ser He 
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290 



295 



300 



Asn Arg Leu Arg Leu Arg Lys Leu 
305 ' 310 



<210> 5 
<211> 312 
<212> PRT 

<213> Mycobacterium tuberculosis 
<400> 5 

Met Ser Arg Leu Ser Glu Pro Ser Pro Tyr Val Glu Phe Asp Arg Arg 
15 10 15 

Gin Trp Arg Ala Leu Arg Met Ser Thr Pro Leu Ala Leu Thr Glu Glu 
20 25 30 

Glu Leu Val Gly Leu Arg Gly Leu Gly Glu Gin lie Asp Leu Leu Glu 
35 40 45 

Val Glu Glu Val Tyr Leu Pro Leu Ala Arg Leu lie His Leu Gin Val 
50 55 60 

Ala Ala Arg Gin Arg Leu Phe Ala Ala Thr Ala Glu Phe Leu Gly Glu 
65 70. 75 80 

Pro Gin Gin Asn Pro Asp Arg Pro Val Pro Phe Tie lie Gly Val Ala 
85 90 -95 

Gly Ser Val Ala Val Gly Lys Ser Thr Thr Ala Arg Val -Leu Gin Ala 
100 105 110 

Leu Leu Ala Arg Trp Asp His His Pro Arg Val Asp Leu Val Thr Thr 
115 120 125 

Asp Gly Phe Leu Tyr Pro Asn Ala Glu Leu Gin Arg Arg Asn Leu -Met 
130 135 , 140 

His Arg Lys Gly Phe Pro Glu Ser Tyr Asn Arg Arg Ala Leu Met Arg 
145 150 155 160 

Phe Val Thr Ser Val Lys Ser *Gly Ser Asp Tyr Ala Gys Ala Pro Val 
165 170 175 

Tyr Ser His Leu His Tyr Asp lie lie Pro Gly Ala Glu *Gln Val Val 
180 185 190 

Arg His Pro Asp He Leu lie Leu Glu Gly Leu Asn Val Leu -Gin Thr 
195 2O0 205 

Gly Pro Thr Leu Met Val Ser Asp Leu Phe Asp Phe Ser Leu Tyr Val 
210 215 220 

Asp Ala Arg He Glu Asp He Glu -Gin Trp Tyr Val -Ser Arg Phe Leu 
225 230 235 240 

Ala Met Arg Thr Thr Ala Phe Ala Asp Pro Glu Ser Uis Phe His His 
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245 



250 



255 



Tyr 



Ala Ala 



Phe Ser Asp 
260 



Ser Gin Ala Val Val Ala Ala Arg Glu lie 
265 270 



Trp Arg Thr He Asn Arg Pro Asn Leu Val Glu Asn lie Leu Pro Thr 
275 280 285 

Arg Pro Arg Ala Thr Leu Val Leu Arg Lys Asp Ala Asp His Ser He 
290 " 295 300 

Asn Arg Leu Arg Leu Arg Lys Leu 



<210> 6 
<211> 329 
<212> PRT 

<213> Streptomyces coelicolor 
<400> 6 

Met He Ser Pro Val Pro Ser He Pro Arg Ser Ala 'His Arg Gin Arg 
15 10 15 

Pro Glu Ala Thr Pro Tyr Val Asp Leu Thr Arg Pro Glu Trp Ser Ala 
20 25 30 

Leu Arg Asp Lys Thr Pro Leu Pro Leu Thr Ala Glu Cilu Val Glu Lys 
35 40 45 

Leu Arg Gly Leu Gly Asp Val He Asp Leu Asp Glu Val Arg Asp lie 
50 55 60 

Tyr Leu Pro Leu Ser Arg Leu Leu Asn Leu Tyr Val Gly Ala Thr Asp 
65 70 75 80 

Gly Leu Arg Gly Ala Leu Asn Thr Phe Leu Gly Glu Gin Gly Ser Gin 
85 90 95 

Ser Gly Thr Pro Phe Val lie Gly Val Ala Gly Ser Val Ala Val Gly 
100 105 110 

Lys Ser Thr Val Ala Arg Leu Leu Gin Ala Leu Leu Ser Arg Trp Pro 
115 120 125 

Glu His Pro Arg Val Glu Leu Val Thr Thr Asp Gly Phe Leu Leu Pro 
130 135 140 

Thr Arg Glu Leu Glu Ala Arg Gly Leu Met Ser Arg Lys Gly Phe Pro 
145 150 155 160 

Glu Ser Tyr Asp Arg Arg Ala Leu Thr Arg Phe Val Ala Asp lie Lys 
165 170 175 

Ala Gly Lys Ala Glu Val Thr Ala Pro Val Tyr Ser His Leu lie Tyr 
180 185 190 

Asp He Val Pro Asp Gin Arg Leu Val Val Arg Arg Pro Asp lie Leu 



305 



310 
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195 200 205 

He Val Glu Gly Leu Asn Val Leu GLn Pro Ala Leu Pro <51y Lys Asp 
210 ' 215 220 

Gly Arg Thr Arg Val Gly Leu Ala Asp Tyr Phe Asp Phe Ser Val Tyr 
225 " 230 235 240 

Val Asp Ala Arg Thr Glu Asp He <Glu Ar<g Trp Tyr Leu Asn Arg Phe 
245 250 255 

Arg Lys Leu Arg Ala Thr Ala Phe Gin Asn Pro Ser Ser Tyr Phe Arg 
260 2*5 270 

Lys Tyr Thr Gin Val Ser Glu Glu Glu Ala Leu Asp Tyr Ala Arg Thr 
275 280 285 

Thr Trp Arg Thr He Asn Lys Pro Asn Leu Val Glu A9n Val Ala Pro 
290 295 300 

Thr Arg Gly Arg Ala Thr Leu Val Leu Arg Lys Gly Pro Asp His Lys 
305 310 315 320 

Val Gin Arg Leu Ser Leu Arg Lys l>eu 
325 



<210> 7 
<211> 265 
<212> PRT 

<213> Streptomyces coelicolor 
<400> 7 

Met Leu Leu Thr He Asp Val Gly Asn Thr His Thr Val Leu *Gly Leu 
1 5 10 15 

Phe Asp Gly Glu Asp He Val Glu His Trp Arg He Ser Thr Asp Ser 
20 ,25 30 

Arg Arg Thr Ala Asp Glu Leu Ala Val Leu Leu Gin Gly Leu <Met <31y 
35 40 45 

Met His Pro Leu Leu Gly Asp Glu Leu Gly Asp Gly lie Asp Gly He 
50 55 -60 

Ala He Cys Ala Thr Val Pro Ser Val Leu His Glu Leu Arg Glu Val 
65 ' 70 75 80 

Thr Arg Arg Tyr Tyr Gly Asp Val Pro Ala Val Leu Val -Glu Pro <51y 
85 90 95 

Val Lys Thr Gly Val Pro He Leu Thr Asp His Pro i-ys -Glu Val <Sly 
100 105 HO 

Ala Asp Arg He He Asn Ala Val Ala Ala Val Glu Leu Tyr Gly Gly 
115 120 125 

Pro Ala He Val Val Asp Phe Gly Thr Ala Thr Thr Phe Asp Aia Val 
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130 



135 



140 



Ser Ala Arg Gly Glu Tyr lie Gly Gly Val He Ala Pro Gly He Glu 
145 150 * 155 160 

He Ser Val Glu Ala Leu Gly Val Lys Gly Ala Gin Leu Arg Lys He 
165 170 175 

Glu Val Ala Arg Pro Arg Ser Val He Gly Lys Asn Thr Val Glu Ala 
180 185 190 

Met Gin Ser Gly He Val Tyr Gly Phe Ala Gly Gin Val Asp Gly Val 
195 200 205 

Val Asn Arg Met Ala Arg Glu Leu Ala Asp Asp Pro Asp Asp Val Thr 
210 215 220 

Val He Ala Thr Gly Gly Leu Ala Pro Met Val Leu Gly Glu Ser Ser 
225 230 235 240 

Val He Asp Glu His Glu Pro Trp Leu Thr Leu Met Gly Leu Arg Leu 
245 250 1 255 



Val Tyr Glu Arg Asn Val Ser Arg Met 
260 265 



<210> 8 
<211> 272 
<212> PRT 

<213> Mycobacterium tuberculosis 
<400> 8 

Met Leu Leu Ala He Asp Val Arg Asn Thr His Thr Val Val Gly Leu 
1 5 10 15 

Leu Ser Gly Met Lys Glu His Ala Lys Val Val Gin Gin Trp Arg He 
20 25 30 

Arg Thr Glu Ser Glu Val Thr Ala Asp Glu Leu Ala Leu Thr He Asp 
35 40 45 

Gly Leu He Gly Glu Asp Ser Glu Arg Leu Thr Gly Thr Ala Ala Leu 
50 55 60 

Ser Thr Val Pro Ser Val Leu His -Glu Val Arg He Met Leu Asp Gin 
65 70 75 80 

Tyr Trp Pro Ser Val Pro His Val Leu He Glu Pro Gly Val Arg Thr 
85 90 95 

Gly He Pro Leu Leu Val Asp Asn Pro Lys Glu Val Gly Ala Asp Arg 
100 105 110 

He Val Asn Cys Leu Ala Ala Tyr Asp Arg Phe Arg Lys Ala Ala He 
115 120 125 



Val Val Asp Phe Gly Ser Ser He Cys Val Asp Val Val Ser Ala Lys 
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130 



135 



140 



Gly Glu Phe Leu Gly Gly Ala He Ala <Pro Gly Val <51n Val Ser Ser 
14 5 150 155 160 

Asp Ala Ala Ala Ala Arg Ser Ala Ala Leu Arg Arg Val -Glu I>eu Ala 
165 170 175 

Arg Pro Arg Ser Val Val Gly Lys Asn Thr Val Glu Cys Met Gin Ala 
180 . 185 190 

Gly Ala Val Phe Gly Phe Ala Gly Leu Val Asp Gly Leu Val -Gly Arg 
195 " 200 205 

He Arg Glu Asp, Val Ser Gly Phe Ser Val Asp His Asp Val Ala He 
210 2 IS 220 

Val Ala Thr Gly His Thr Ala Pro Leu Leu Leu Pro Glu Leu Hi-s Thr 
225 230 235 240 

Val Asp His Tyr Asp Gin His Leu Thr Leu Gin Gly Leu Arg Leu Val 
24 5 250 255 



Phe Glu Arg Asn Leu Glu Val Gin Arg Gly Arg Leu Lys Thr Ala Arg 
260 2-65 270 



<210> 9 
<211> 258 
<212> PRT 

<213> Bacillus subtilis 
<400> 9 

Leu Leu Leu Val He Asp Val Gly Asn Thr Asn Thr Val Leu Gly Val 
15 10 15 

Tyr His Asp Gly Lys Leu <Glu Tyr His Trp Arg lie Glu Thr "Ser Arg 
20 25 30 

His Lys Thr Glu Asp Glu Phe Gly Met lie Leu Arg Ser Leu Phe Asp 
35 40 45 

His Ser Gly Leu Met Phe Glu Gin He Asp Gly He He He Ser Ser 
50 55 ^60 

Val Val Pro Pro lie Met Phe Ala Leu Glu Arg Met Cys Thr Lys Tyr 
€5 70 75 80 

Phe His lie Glu Pro Gin He Val Gly Pro Gly Met Lys Thr -Gly Leu 
85 90 95 

Asn lie Lys Tyr Asp Asn Pro Lys Glu Val Gly Ala Asp Arg lie Val 
100 105 HO 

Asn Ala Val Ala Ala lie His Leu Tyr Gly Asn Pro Leu He Val Val 
115 120 125 
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Asp Phe Gly Thr Ala Thr Thr Tyr Cys Tyr lie Asp Glu Asn Lys Gin 
130 135 140 

Tyr Met Gly Gly Ala lie Ala Pro Gly He Thr He Ser Thr Glu Ala 
145 150 155 160 

Leu Tyr Ser Arg Ala Ala Lys Leu Pro Arg He Glu He Thr Arg Pro 
165 170 , 175 

Asp Asn lie He Gly Lys Asn Thr Val Ser Ala Met Gin Ser Gly He 
180 185 190 

Leu Phe Gly Tyr Val Gly Gin Val Glu Gly He Val Lys Arg Met Lys 
195 200 205 

Trp Gin Ala Lys Gin Asp Leu Lys Val He Ala Thr Gly Gly Leu Ala 
210 215 220 

Pro Leu He Ala Asn Glu Ser Asp Cys He Asp He Val Asp Pro Phe 
225 230 235 240 

i 

Leu Thr Leu Lys Gly Leu Glu Leu He Tyr Glu Arg Asn Arg Val Gly 
245 250 255 

Ser Val 



<210> 10 
<211> 262 
<212> PRT 

<213> Deinococcus radiopugnans 
<400> 10 

Met Pro Ala Phe Pro Leu Leu Ala Val Asp He Gly Asn Thr Thr Thr 
15 10 15 

Val Leu -Gly Leu Ala Asp Ala Ser Gly Ala Leu Thr His Thr Trp Arg 
20 25 30 

He Arg Thr Asn Arg Glu Met Leu Pro Asp Asp Leu Ala Leu -Gin Leu 
35 40 ' 45 

His Gly Leu Phe Thr Leu Ala Gly Ala Pro He Pro Arg Ala Ala Val 
50 55 60 

Leu Ser Ser Val Ala Pro Pro Val Gly Glu Asn Tyr Ala Leu Ala Leu 
65 70 75 80 

Lys Arg His Phe Met He Asp Ala Phe Ala Val Ser Ala Glu Asn Leu 
85 90 95 

Pro Asp Val Thr Val Glu Leu Asp Thr Pro Gly Ser Val Gly Ala Asp 
100 105 110 



Arg Leu Cys Asn Leu Phe Gly Ala Glu Lys Tyr Leu Gly Gly Leu Asp 
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115 120 • 125 

Tyr Ala Val Val Val Asp Phe <51y Thr Ser Thr Asn Phe Asp Val Val 
130 135 140 

i 

Gly Arg Gly Arg Arg Phe Leu -Gly Gly lie Leu Ala Thr Gly Ala Gin 
145 150 155 160 

Val Ser Ala Asp Ala Leu Phe Ala Arg Ala Ala Lys Leu Pro Arg lie 
L65 170 175 

Thr Leu Gin Ala Pro Glu Thr Ala He *Gly Lys Asn Thr Val His Ala 
180 185 190 

Leu Gin Ser Gly Leu Val Phe -Gly Tyr Ala -Glu Met Val Asp -Gly Leu 

195 200 205 

* 

Leu Arg Arg He Arg Ala Glu Leu Pro <Uy Glu Ala Val Ala Val Ala 
210- 215 220 

» 

Thr Gly Gly Phe Ser Arg Thr Val Gin Gly He «Cys -Gin Glu He Asp 
' 225 " " 230 235 240 

Tyr Tyr Asp Glu Thr Leu Thr Leu Arg Gly Leu Val -Glu Leu Trp Ala 
24 5 250 255 

Ser Arg Ser Glu Val Arg 
2*0 



<210> 11 
<211> 212 
<212> PRT 

<213> Desulfovibrio vulgaris 
<400> 11 

Met Thr Gin His Phe Leu Leu Phe Asp lie Gly Asn Thr Asn Val Xys 
15 10 15 

He Gly He Ala Val Glu Thr Ala Val Leu Thr Ser Tyr Val Leu Pro 
20 25 30 

Thr Asp Pro Gly Gin Thr Thr Asp Ser lie *Gly -Leu Arg Leu Leu Glu 
35 40 45 

Val Leu Arg His Ala Gly Leu Gly Pro Ala Asp Val <31y Ala Cys Val 
50 55 60 

Ala Ser Ser Val Val Pro Gly Val Asn Pro Leu lie Arg Arg Ala Cys 
65 70 75 80 

Glu Arg Tyr Leu Tyr Arg Lys Leu Leu Phe Ala Pro <31y Asp lie Ala 
85 90 95 

lie Pro Leu Asp Asn Arg Tyr Glu Arg Pro Ala Glu Val ^ly Ala Asp 
100 105 HO 

Arg Leu Val Ala Ala Tyr Ala Ala Arg Arg Xeu Tyr Pro <31y Pro Arg 
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115 120 125 

Ser Leu Val Ser Val Asp Phe Gly Thr Ala Thr Thr Phe Asp Cys Val 
130 135 140 

Glu Gly Gly Ala Tyr Leu Gly Gly Leu He Cys Pro -Gly Val Leu Ser 
145 150 155 160 

Ser Ala Gly Ala Leu Ser Ser Arg Thr Ala Lys Leu Pro Arg He Ser 
165 170 175 

Leu Glu Val Glu Glu Asp Ser Pro Val He Gly Arg Ser Thr Thr Thr 
180 185 190 

Ser Leu Asn His Gly Phe He Phe Gly Phe Ala Ala Met Thr Glu Gly 
195 200 205 

Val Leu Ala Ala 
210 



<210> 12 
<211> 246 
<212> PRT 

<213> Thermotoga maritima 
<400> 12 

Met Tyr Leu Leu Val 
1 5 

Thr Glu Asp Gly Lys 
20 

Phe Gin Thr Glu Asp 
35 

Asp Ala Met Arg Glu 
50 

Thr Gin Asn Thr Val 
65 

Ser Pro He Trp Val 
85 

Lys Asn Pro Ser Glu 
100 

Phe Val Lys Glu Tyr 
115 

Ala Thr Thr Val Asp 
130 

lie Leu Pro Gly Phe 
145 



Asp Val Gly Asn Thr His Ser Val Phe Ser He 
10 15 

Thr Phe Arg Arg Trp Arg Leu Ser Thr Gly Val 
25 30 

Glu Leu Phe Ser His Leu His Pro Leu Leu Gly 
40 45 

He Lys Gly He Gly Val Ala Ser Val Val Pro 
55 60 

He Glu Arg Phe Ser Gin Lys Tyr Phe His He 
70 75 80 

Lys Ala Lys Asn Gly Cys Val Lys Trp Asn Val 
90 95 

Val Gly Ala Asp Arg Val Ala Asn Val Val Ala 
105 110 

Gly Lys Asn Gly He He He Asp Met Gly Thr 
120 125 

Leu Val Val Asn Gly Ser Tyr Glu Gly Gly Ala 
135 140 



Phe Met Met Val His Ser Leu Phe Arg Gly Thr 
150 155 160 



Ala Lys Leu Pro Leu Val Glu Val Lys Pro Ala Asp Phe Val Val Gly 
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165 170 175 

Lys Asp Thr -Glu €lu Asn He Arg Leu <31y Val Val Asn -Gly Ser Val 
180 185 190 

Tyr Ala Leu Glu Gly lie He <31y Arg lie Lys Glu Val Tyr *Gly Asp 
195 200 205 

Leu Pro Val Val Leu Thr Gly Gly Gin 'Ser Lys lie Val Lys Asp Met 
210 215 220 

He Lys His Glu He Phe Asp Glu Asp Leu Thr lie Lys XUy Val Tyr 
225 230 ^35 240 

His Phe Cys Phe Gly Asp 
245 



<210> 13 

<211> 273 i 
<212> PRT 

<213> Treponema pallidum 
<400> 13 

Met Leu Leu He Asp Val Gly Asn Ser His Val Val Phe Gly lie Gin 

1 5 . , , 10 , 15 

Gly Glu Asn Gly Gly Arg Val Cys Val Arg Glu Leu Phe Arg i,eu Ala 
20 25 30 

Pro Asp Ala Arg Lys Thr Gin Asp Glu Tyr Ser Leu Leu He His Ala 
35 40 45 

Leu Cys Glu Arg Ala Gly Val Gly Arg Ala Ser Leu Arg Asp Ala Phe 
50 55 60 

He Ser Ser Val Val Pro Val Leu Thr Lys Thr He Ala Asp Ala Val 
65 70 75 80 

Ala Gin lie Ser Gly Val Gin Pro Val Val Phe <Gly Pro T*p Ala Tyr 
85 90 ^5 

Glu His Leu Pro Val Arg lie Pro «Glu Pro Val Arg Ala Glu lie Gly 
100 105 HO 

Thr Asp Leu Val Ala Asn Ala Val Ala Ala Tyr Val His Phe Arg Ser 
115 120 125 

Ala Cys Val Val Val Asp Cys Gly Thr Ala Leu Thr Phe Thr Ala Val 
130 135 140 

Asp Gly Thr Gly Leu He Gin Gly Val Ala He Ala Pro Gly Leu Arg 
145 ' 150 155 160 

Thr Ala Val Gin Ser Leu His Thr Gly Thr Ala Gin Leu Pro Leu Val 
165 170 175 

Pro Leu Ala Leu Pro Asp Ser Val Leu Gly Lys Asp Thr Thr His Ala 
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180 



185 



190 



Val Gin Ala Gly Val Val Arg Gly Thr Leu Phe Val lie Arg; Ala Wet 
195 200 205 

He Ala Gin Cys Gin Lys Glu Leu Gly Cys Arg Cys Ala Ala Val He 
210 215 220 

Thr Gly Gly Leu Ser Arg Leu Phe Ser Ser Glu Val Asp Phe Pro Pro 
225 230 235 240 

lie Asp Ala Gin Leu Thr Leu Ser Gly Leu Ala His He Ala Arg Leu 
245 250 255 

Val Pro Thr Ser Leu Leu Pro Pro Ala Thr Val Ser Gly Ser Ser Gly 
260 265 270 



Asn 



<210> 14 
<211> 262 
<212> PRT 

<213> Borrelia burgdorferi 
<400> 14 

Met Asn Lys Pro Leu Leu Ser Glu Leu lie He Asp He Gly Asn Thr 
1 5 10 ~ 15 

Ser He Ala Phe Ala Leu Phe Lys Asp Asn Gin Val Asn Leu Phe He 
20 25 30 

Lys Met Lys Thr Asn Leu Met Leu Arg Tyr Asp Glu Val Tyr Ser Phe 
35 4 0 4 5 

Phe Glu Glu Asn Phe Asp Phe Asn Val Asn Lys Val Phe He Ser Ser 
50 55 60 

Val Val Pro He Leu Asn Glu Thr Phe Lys Asn Val He Phe Ser Phe 
65 70 75 80 

Phe Lys He Lys Pro Leu Phe He Gly Phe Asp Leu Asn Tyr Asp Leu 
85 90 ' 95 

Thr Phe Asn Pro Tyr Lys Ser Asp Lys Phe Leu Leu Gly Ser Asp Val 
100 105 " 110 

Phe Ala Asn Leu Val Ala Ala He Glu Asn Tyr Ser Phe Glu Asn Val 
115 120 125 

Leu Val Val Asp Leu Gly Thr Ala Cys Thr He Phe Ala Val Ser Arg 
130 135 140 

Gin Asp Gly He Leu Gly Gly lie He Asn Ser Gly Pro Leu He Asn 
145 150 155 160 



Phe Asn Ser Leu Leu Asp Asn Ala Tyr Leu lie Lys Lys Phe Pro He 
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165 170 . 175 

Ser Thr Pro Asn Asn Leu Leu Glu Ar<g Thr Thr *Ser t31y Ser Val Asn 
180 185 190 

Ser Gly Leu Phe Tyr £ln Tyr Lys Tyr Leu lie <31u *Gly Val Tyr Arg 
195 200 205 

Asp He Lys tUn Met Tyr Lys Lys Lys Phe Asn Leu lie lie Thr .Gly 
210 215 220 

Gly Asn Ala Asp Leu He Leu Ser Leu He Glu lie Glu Phe He Phe 
225 230 235 240 

Asn He His Leu Thr Val Glu Gly Val Arg He Leu «Gly Asn Ser He 
245 250 255 

Asp Phe Lys Phe Val Asn 
260 



<210> 15 
<211> 229 
<212> PRT 

<213> Aquifex aeolicus 
<400> 15 

Met Arg Phe Leu Thr Val Asp Val 'Gly Asn Ser Ser Val Asp He Ala 
15 10 15 

Leu Trp Glu Gly Lys Lys Val Lys Asp Phe Leu Lys Leu Ser His -Glu 
20 25 30 

Glu Phe Leu Lys Glu Glu Phe Pro Lys Leu Lys Ala Leu -Gly lie Ser . 
35 40 45 

Val Lys Gin Ser Phe Ser Glu Lys Val Arg Gly Lys lie Pro Lys He 
50 55 60 

Lys Phe Leu Lys Lys Glu Asn Phe Pro lie 'Gin Val Asp Tyr Lys Thr 
65 70 75 80 

Pro Glu Thr Leu Gly Thr Asp Arg Val Ala Leu Ala Tyr Ser Ala Lys 
85 90 95 

Lys Phe Tyr Gly Lys Asn Val Val Val He Ser Ala Gly Thr Ala Leu 
100 105 110 

Val He Asp Leu Val Leu Glu Gly Lys Phe Lys Gly .Gly Phe He Thr 
115 120 125 

Leu Gly Leu Gly Lys Lys Leu Lys He Leu Ser Asp Leu Ala Glu Gly 
130 135 140 

He Pro Glu Phe Phe Pro Glu Glu Val Glu lie Phe Leu ^Gly Arg 3er 
145 150 155 160 

Thr Arg Glu Cys Val Leu Gly <Gly Ala Tyr Arg £lu Ser Thr -Glu Phe 
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165 170 175 

He Lys Ser Thr Leu Lys Leu Trp Arg Lys Val Phe Lys Arg Lys Phe 
180 185 190 

Lys Val Val He Thr Gly Gly Glu Gly Lys Tyr Phe Ser Lys Phe Gly 
195 200 205 

He Tyr Asp Pro Leu Leu Val His Arg Gly Met Arg Ash Leu Leu Tyr 
210 215 220 

Leu Tyr His Arg He 
225 



<210> 16 
<211> 257 
<212> PRT 

<213> Synechocystis sp. ( 
<400> 16 

Met Glu Thr Ser Lys Pro Gly Cys Gly Leu Ala Leu Asp Asn Asp Lys 
15 10 15 

Gin Lys Pro Trp Leu Gly Leu Met He Gly Asn Ser Arg Leu His Trp 
20 25 30 

Ala Tyr Cys Ser Gly Ash Ala Pro Leu Gin Thr Trp Val Thr Asp Tyr 
35 40 45 

Asn Pro Lys Ser Ala Gin Leu Pro Val Leu Leu Gly Lys Val Pro Leu 
50 55 60 

Met Leu Ala Ser Val Val Pro Glu Gin Thr <Slu Val Trp Arg Val Tyr 
65 70 75 80 

Gin Pro Lys He Leu Thr Leu Lys Asn Leu Pro Leu Val Asn Leu Tyr 
85 90 95 

Pro Ser Phe Gly He Asp Arg Ala Leu Ala Gly Leu Gly Thr Gly Leu 
100 105 110 

Thr Tyr Gly Phe Pro Cys Leu Val Val Asp Gly Gly Thr Ala Leu Thr 
115 120 125 

He Thr Gly Phe Asp Gin Asp Lys Lys Leu Val Gly Gly Ala He Leu 
130 135 140 

Pro Gly Leu Gly Leu Gin Leu Ala Thr Leu Gly Asp Arg Leu Ala Ala 
145 150 155 * 160 

Leu Pro Lys Leu Glu Met Asp Gin Leu Thr Glu Leu Pro Asp Arg Trp 
165 170 175 

Ala Leu Asp Thr Pro Ser Ala He Phe Ser Gly Val Val Tyr Gly Val 
180 185 190 

Leu Gly Ala Leu Gin Ser Tyr Leu <31n Asp Trp Gin Lys Leu Phe Pro 
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195 200 • 205 

Gly Ala Ala Jiet Val lie Thr Gly Cly Asp -Gly Lys lie Leu His Gly 

210 215 220 

Phe Leu Lys Glu His Ser Pro Asn Leu Ser Val Ala Trp Asp Asp Asn 

225 230 235 240 

Leu He Phe Leu Gly Met Ala Ala lie His ttis Cly Asp Arg Pro He 

24 5 250 555 



Cys 



<210> 17 
<211> 223 
<212> PRT 

<213> Helicobacter pylori 

i 

<400> 17 

Met Pro Ala Arg 'Gin Ser Phe Thr Asp Leu Lys Asn Leu Val Leu Cys 
1 5 10 15 

Asp He Gly Asn Thr Arg He His Phe Ala «Gin Asn Tyr 'Gin Leu Phe 
20 . -25 30 

Ser Ser Ala Lys Glu Asp Leu Lys Arg Leu l Gly He Gin Lys Glu He 
35 40 4 5 

Phe Tyr He Ser Val Asn Glu -Glu Asn Glu Lys Ala Leu Leu Asn Cys 
50 55 €0 

Tyr Pro Asn Ala Lys Asn lie Ala Gly Phe Phe His Leu -Glu Thr Asp 
65 70 75 SO , 

Tyr Val Gly Leu Gly lie Asp Arg -Gin Met Ala Cys Leu Ala Val Asn 
85 90 95 

Asn Gly Val Val Val Asp Ala -Gly Ser Ala He Thr lie Asp Leu lie 
1-00 lt)5 HO 

Lys Glu Gly Lys His Leu Gly Gly Cys He Leu Pro -Gly Leu Ala .Gin 
115 120 125 

Tyr He His Ala Tyr Lys Lys Ser Ala Lys lie Leu Glu Gin Pro Phe 
130 135 140 

Lys Ala Leu Asp Ser -Leu Glu Val Leu Pro Lys Ser Thr Arg Asp Ala 
14 5 150 155 160 

Val Asn Tyr Gly Met Val Leu "Ser Val He Ala Cys lie Gin His Leu 
165 170 175 

Ala Lys Asn Gin Lys lie Tyr Leu Cys Gly Cly Asp Ala Lys Tyr Leu 
160 185 190 

Ser Ala Phe Leu Pro His Ser Val Cys Lys Glu Arg Leu Val Phe Asp 
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195 200 205 

Gly Met Glu lie Ala Leii Lys Lys Ala Gly He Leu Glu Cys Lys 
210 215 220 



<210> 18 
<211> 267 
<212> PRT 

<213> Bordetella pertussis 
<400> 18 

Met He He Leu He Asp Ser Gly Asn Ser Arg Leu Lys Val Gly Trp 
1 5 10 15 

Phe Asp Pro Asp Ala Pro Gin Ala Ala Arg Glu Pro Ala Pro Val Ala 
20 25 30 

Phe Asp Asn Leu Asp Leu Asp Ala Leu Gly Arg Trp Leu Ala Thr Leu 
35 40 45 

Pro Arg Arg Pro Gin Arg Ala Leu Gly Val Asn Val Ala Gly Leu Ala 
50 55 60 

Arg Gly Glu Ala He Ala Ala Thr Leu Arg Ala Gly Gly Cys Asp He 
65 70 75 ' " 80 

Arg Trp Leu Arg Ala Gin Pro Leu Ala Met Gly Leu Arg Asn Gly Tyr 
85 90 95 

Arg Asn Pro Asp Gin Leu Gly Ala Asp Arg Trp Ala Cys Met Val Gly 
100 105 HO 

Val Leu Ala Arg Gin Pro Ser Val His Pro Pro Leu Leu Val Ala Ser 
115 120 125 

Phe Gly Thr Ala Thr Thr Leu Asp Thr lie Gly Pro Asp Asn Val Phe 
130 135 140 

Pro Gly Gly Leu He Leu Pro Gly Pro Ala Met Met Arg Gly Ala Leu 
145 150 155 160 

Ala Tyr Gly Thr Ala His Leu Pro Leu Ala Asp Gly Leu Val Ala Asp 
165 170 175 

Tyr Pro He Asp Thr His Gin Ala He Ala Ser Gly He Ala Ala Ala 
180 185 190 

Gin Ala Gly Ala He Val Arg Gin Trp Leu Ala Gly Arg Gin Arg Tyr 
195 200 205 

Gly Gin Ala Pro Glu lie Tyr Val Ala Gly Gly Gly Trp Pro Glu Val 
210 215 220 

Arg Gin Glu Ala Glu Arg Leu Leu Ala Val Thr Gly Ala Ala Phe Gly 
225 230 235 240 

Ala Thr Pro -Gin Pro Thr Tyr Leu Asp Ser Pro Val Leu Asp Gly Leu 
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24 5 550 255 

Ala Ala Leu Ala Ala Gin Gly Ala Pro Thr Ala 
260 ' 265 



<210> 19 
<211> 777 
<212> DNA 

<213> Bacillus subtilis 

<220> 

<221> CDS 

<222> (1) . . (774) 

<400> 19 

ttg tta ctg gtt ate gat gtg ggg aac acc aat act gta ctt ggt gta 
Leu Leu Leu Val He Asp Val Gly Asn Thr Asn Thr Val Leu Gly Val 
15 10 15 

tat cat gat gga aaa tta gaa tat cac tgg cgt ata gaa aca age agg 
Tyr His Asp Gly Lys Leu Glu Tyr His Trp Arg lie -Glu Thr Ser Arg 
20 25 30 

cat aaa aca gaa gat gag ttt ggg atg att ttg cgc tec tta ttt gat 
His Lys Thr Glu Asp Glu Phe Gly Met He Leu Arg Ser Leu Phe Asp 
35 40 45 

cac tec ggg ett atg ttt gaa cag ata gat ggc att att att teg tea 
His Ser Gly Leu Met Phe Glu ^Gln lie Asp Gly lie lie lie Ser Ser 
50 ^5 €0 

gta gtg ecg cca ate atg ttt gcg tta gaa aga atg tgc aea aaa tac 
Val Val Pro Pro He Met Phe Ala Leu Glu Arg Met Cys Thr Lys Tyr 
65 70 75 60 

ttt cat ate gag cct caa att gtt ggt cca ggt atg aaa aec ggt tta 
Phe His He Glu Pro Gin He Val -Gly Pro 'Gly Met Lys Thr. Gly Leu 
85 90 95 



48 



96 



144 



192 



240 



288 



aat ata aaa tat gac aat ccg aaa gaa gta ggg gea gac aga ate gta 336 
Asn He Lys Tyr Asp Asn Pro Lys Glu Val Gly Ala Asp Arg He Val 
100 105 HO 



aat get gtc get gcg ata cac ttg tae ggc aat oca tta att gtt gtc 
Asn Ala Val Ala Ala He His Leu Tyr Gly Asn Pro Leu He Val Val 
115 120 125 



384 



gat ttc gga acc gee aca aeg tae tgc tat att gat gaa aac aaa caa 432 
Asp Phe Gly Thr Ala Thr Thr Tyr Cys Tyr He Asp Glu Asn Lys Gin 
130 135 140 



tac atg ggc ggg gcg att gec cct ggg att aea att teg aca gag gcg 
Tyr Met Gly Gly Ala He Ala Pro -Gly He Thr He Ser Thr Glu Ala 
145 150 155 160 

ctt tac teg cgt gea gca aag ett oct cgt ato gaa ate acc ogg occ 
Leu Tyr Ser Arg Ala Ala Lys Leu Pro Arg He Glu He Thr Arg Pro 
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165 170 175 

gac aat att ate gga aaa aac act gtt age gcg atg caa tct gga att 576 
Asp Asn lie lie Gly Lys Asn Thr Val Ser Ala Met Gin Ser Gly lie 
180 185 190 

tta ttt ggc tat gtc ggc caa gtg gaa gga ate gtt aag cga atg aaa 624 
Leu Phe Gly Tyr Val Gly Gin Val Glu Gly He Val Lys Arg Met Lys 
195 200 * 205 i 

tgg cag gea aaa cag gac etc aag gtc att gcg aca gga ggc ctg gcg £72 
Trp Gin Ala Lys Gin Asp Leu Lys Val He Ala Thr Gly Gly Leu Ala 
210 215 220 

ecg etc att gcg aac gaa tea gat tgt ata gac ate gtt gat cca ttc • 720 
Pro Leu He Ala Asn Glu Ser Asp Cys He Asp He Val Asp Pro Phe 
225 230 235 ' 240 

tta ace eta aaa ggg ctg gaa ttg att tat gaa aga aac cgc gta gga 768 
Leu Thr Leu Lys Gly Leu Glu Leu He Tyr Glu Arg Asn Arg Val Gly 
245 250 255 

agt gta tag 777 
Ser Val 



<210> 20 
<211> 960 
<212> DNA 

<213> Bacillus subtilis 

<220> 

<221> CDS 

<222> (1) . . (957) 

<400> 20 

gtg aaa aat aaa gaa ctt aac eta cat act tta tat aca cag cac aat ' 48 

Met Lys Asn Lys Glu Leu Asn Leu His Thr Leu Tyr Thr Gin His Asn 
1 5 10 15 

egg gag tct tgg tct ggt ttt ggg ggg cat ttg teg att get gta tct 96 
Arg Glu Ser Trp Ser Gly Phe Gly Gly His Leu Ser He Ala Val Ser 
20 25 30 

gaa gaa gag gca aaa get gtg gaa gga ttg aat gat tat eta tct gtt 14 4 
Glu Glu Glu Ala Lys Ala Val Glu Gly Leu Asn Asp Tyr Leu Ser Val 
35 40 45 

gaa gaa gtg gag acg ate tat att ccg ctt gtt cgc ttg ctt cat tta 192 
Glu Glu Val Glu Thr He Tyr He Pro Leu Val Arg Leu Leu His Leu 
50 55 60 

cat gtc aag tct gcg get gaa cgc aat aag cat gtc aat gtt ttt ttg 240 
His Val Lys Ser Ala Ala Glu Arg Asn Lys His Val Asn Val Phe Leu 
65 70 75 80 

aag cac cca cat tea gee aaa att ccg ttt att ate ggc att gec ggc 288 
Lys His Pro His Ser Ala Lys He Pro Phe He He Gly He Ala Gly 
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85 , 90 95 

agt gtc gca gtc gga aaa age acg acg gcg egg ate ttg cag aag ctg 
Ser Val Ala Val Gly Lys Ser Thr Thr Ala Arg lie Leu Gin Lys Leu 
100 105 HO 

ctt teg egt ttg cct gac egt cca aaa gtg age ctt ate- acg aea gat 

Leu Ser Arg Leu Pro Asp Arg Pro Lys Val Ser Leu ILe Thr Thr Asp 
115 120 125 

* 

ggt ttt tta ttt cct act gee gag ctg aaa aag aaa aat atg atg tea 

Gly Phe Leu Phe Pro Thr Ala Glu Leu Lys Lys Lys Asn Met Met Ser 
130 135 140 

aga aaa gga ttt cct gaa age tat gat gta aag gcg »ctg <:tc gaa ttt 
Arg Lys Gly Phe Pro Glu Ser Tyr Asp Val Lys Ala Leu Leu Glu Phe 
145 J 150 155 ( 160 

ttg aat gac tta aaa tea gga aag gac age gta aag gee ccg gtg tat 
Leu Asn Asp Leu Lys Ser Gly Lys Asp Ser Val Lys Ala Pro Val Tyr 
165 170 175 



336 



384 



432 



480 



528 



tec cat eta acc tat gac cgc gag gaa ggt gtg tte gag gtt gta gaa 576 
Ser His Leu Thr Tyr Asp Arg Glu -Glu Gly Val Phe -Glu Val Val Glu 
180 185 190 



cag gcg gat att gtg att att gaa gge att aat gtt ctt cag teg cec 
Gin Ala Asp He Val He He Glu Gly lie Asn Val Leu Gin ^Ser Pro 
195 200 205 



ttt gat ttt teg att tat gtg gat gcg gag gaa age egg att tte act 
Phe Asp Phe Ser He Tyr Val Asp Ala Glu Glu Ser Arg He Phe Thr 
225 * 230 235 240 

tgg tat tta gag egt ttt cgc ctg ctt .egg gaa aea get ttt caa aat 
Trp Tyr Leu Glu Arg Phe Arg Leu Leu Arg Glu Thr Ala Phe Gin Asn 
24 5 250 255 

cct gat tea tat ttt cat aaa ttt aaa gac ttg tec .gat <:ag gag get 
Pro Asp Ser Tyr Phe His Lys Phe Lys Asp Leu Ser Asp *Gln Glu Ala 
260 2-65 270 

gac gag atg gca gee teg att tgg gag agt gtc aac egg ccg aat tta 
Asp Glu Met Ala Ala Ser He Trp Glu "Ser Val Asn Arg -Pro Asn Leu 
275 280 285 

tat gaa aat att ttg cca act aaa tte agg tea -gat etc att ttg .egt 
Tyr Glu Asn He Leu Pro Thr Lys Phe Arg Ser Asp Leu He Leu Arg 
290 295 300 

aag gga gac ggg cat aag gtc gag gaa gtg ttg gta agg agg gta tga 
Lys Gly Asp Gly His Lys Val Glu Glu Val Leu Val Arg Arg Val 
305 ^ 310 315 



€24 



acc ttg gag gat gac egg gaa aac ccg egt att ttt gtt tec gat tte 672 
Thr Leu Glu Asp Asp Arg Glu Asn Pro Arg He Phe Val Ser Asp Phe 
210 215 2-20 



720 



7-68 



81«6 



8«64 



912 



960 
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<210> 21 
<211> 882 
<212> DNA 

<213> Bacillus subtilis 

<220> 

<221> CDS 

<222> (1)..(879) 

<400> 21 

ttg teg att get gta tct gaa gaa gag gca aaa get gtg gaa gga ttg 4 8 

Met Ser He Ala Val Ser Glu Glu Glu Ala Lys Ala Val Glu Gly Leu 

1 5 10 15 

aat gat tat eta tct gtt gaa gaa gtg gag acg ate tat att ccg ctt 96 
Asn Asp Tyr Leu Ser Val Glu Glu Val <31u Thr He Tyr He Pro Leu 
20 25 30 

gtt cgc ttg ctt cat tta cat gtc aag tct gcg get gaa cgc aat aag 144 
Val Arg Leu Leu His Leu His Val Lys Ser Ala Ala Glu Arg Asn Lys 
35 40 > 45 

cat gtc aat gtt ttt ttg aag cac cca cat tea gee aaa att ccg ttt 192 
His Val Asn Val Phe Leu Lys His Pro His Ser Ala Lys He Pro Phe 
50 55 60 

att ate ggc att gec ggc agt gtc gca gtc gga aaa age acg acg gcg 240 
lie He Gly He Ala Gly Ser Val Ala Val Gly Lys Ser Thr Thr Ala 
65 70 75 80 

egg ate ttg cag aag ctg ctt teg cgt ttg cct gac cgt cca aaa gtg 288 
Arg He Leu Gin Lys Leu Leu Ser Arg Leu Pro Asp Arg Pro Lys Val 
85 90 95 

age ctt ate acg aca gat ggt ttt tta ttt cct act gee gag ctg aaa 336 
Ser Leu He Thr Thr Asp Gly Phe Leu Phe Pro Thr Ala Glu Leu Lys ' 
100 105 HO 

aag aaa aat atg atg tea aga aaa gga ttt cct gaa age tat gat gta 384 
Lys Lys Asn Met Met Ser Arg Lys Gly Phe Pro €lu Ser Tyr Asp Val 
115 120 125 

aag gcg ctg etc gaa ttt ttg aat gac tta aaa tea gga aag gac age 4 32 
Lys Ala Leu Leu Glu Phe Leu Asn Asp Leu Lys Ser Gly Lys Asp Ser 
130 135 140 

gta aag gee ccg gtg tat tec cat eta ace tat gac cgc gag gaa ggt 480 
Val Lys Ala Pro Val Tyr Ser His Leu Thr Tyr Asp Arg Glu Glu Gly 
145 150 155 160 

gtg ttc gag gtt gta gaa cag gcg gat att gtg att att gaa ggc att 528 
Val Phe Glu Val Val Glu Gin Ala Asp He Val He He Glu Gly He 
165 170 175 

aat gtt ctt cag teg ccc ace ttg gag gat gac egg gaa aac ccg cgt 576 
Asn Val Leu Gin Ser Pro Thr Leu Glu Asp Asp Arg Glu Asn Pro Arg 
180 185 190 
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act ttt gtt tec gat ttc ttt gat ttt tc<g att tat gtg gat, gcg gag 624 
lie Phe Val Ser Asp Phe Phe Asp Phe Ser lie Tyr Val Asp Ala <51u 
195 200 205 

gaa age egg att ttc act tgg tat tta gag cgt ttt egc ^ctg ctt egg 672 
Glu Ser Arg lie Phe Thr Trp Tyr Leu Glu Arg Phe Arg. Leu Leu Arg 
210 * 215 220 



gaa aca get ttt caa aat. cet gat tea tat ttt eat aaa ttt aaa gae 
Glu Thr Ala Phe Gin Asn Pro Asp Ser Tyr Phe His Lys Phe Lys Asp 
225 230 235 240 



720 



ttg tec gat cag gag get gac gag atg ^gca gee teg att tgg gag agt 768 
Leu Ser Asp Gin Glu Ala Asp -Glu Met Ala Ala Ser lie Trp Glu Ser 
245 • 250 255 

gtc aac egg ccg aat tta tat gaa aat att ttg eca adt aaa ttc a<gg Bl€ 
Val Asn Arg Pro Asn Leu Tyr Glu Asn He Leu Pro Thr Lys Ph« Arg 
260 265 270 

tea gat etc att ttg cgt aag gga gac ggg eat aag gtc gag gaa gtg 864 
Ser Asp Leu He Leu Arg Lys Gly Asp Gly -His Lys Val Glu Glu Val 
27 5 280 285 



ttg gta agg agg gta tga> 
Leu Val Arg Arg Val 
290 



882 



<210> 22 
<211> 846 
<212> DNA 

<213> Bacillus subtilis 

<220> 

<221> CDS 

<222> (1)..(843) 

<400> 22 

gtg gaa gga ttg aat gat tat eta tct gtt gaa ^gaa ^gtg gag acg ate 48 

Met Glu Gly Leu Asn Asp Tyr Leu Ser Val Glu "Glu Val «Glu Thr lie 
15 10 15 

tat att ccg ctt gtt cgc ttg ctt eat tta eat ^te -aag tct gcg get 96 
Tyr He Pro Leu Val Arg Leu Leu His Leu His Val Lys Ser Ala Ala 
20 25 30 

gaa cgc aat aag cat gtc aat gtt ttt ttg aag eac -eca eat tea gee 144 
Glu Arg Asn Lys His Val Asn Val Phe Leu Lys His Pro His Ser Ala 
35 .40 45 

aaa att ccg ttt att ate ggc att gee ggc agt gtc gca gtc <gga aaa 192 
Lys He Pro Phe He He Gly He Ala <31y Ser Val Ala Val Gly Lys 
50 55 60 

age acg acg gcg egg ate ttg cag aag ctg ctt teg «cgt ttg ect gac 240 
Ser Thr Thr Ala Arg He Leu Gin Lys Leu Leu Ser Arg Leu Pro Asp 
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65 70 75 80 

cgt cca aaa gtg age ctt ate acg aca gat <ggt ttt tta ttt cct act 288 
Arg Pro Lys Val Ser Leu lie Thr Thr Asp Gly Phe Leu Phe Pro Thr 
85 90 95 

gee gag ctg aaa aag aaa aat atg atg tea aga aaa gga ttt cct gaa 336 
Ala Glu Leu Lys Lys Lys Asn Met Met Ser Arg Lys Gly Phe Pro Glu 
100 105 110 

age tat gat gta aag gcg ctg etc gaa ttt ttg aat gac tta aaa tea 384 
Ser Tyr Asp Val Lys Ala Leu Leu Glu Phe Leu Asn Asp Leu Lys Ser 
115 120 125 

gga aag gac age gta aag gec ccg gtg tat tec cat eta acc tat gac 4 32 
Gly Lys Asp Ser Val Lys Ala Pro Val Tyr Ser His Leu Thr Tyr Asp 
130 135 140 

cgc gag gaa ggt gtg ttc gag gtt gta gaa cag gcg gat att gtg att 480 
Arg Glu Glu Gly Val Phe Glu Val Val Glu Gin Ala Asp He Val He 
145 150 155 160 

att gaa ggc att aat gtt ctt cag teg ccc acc ttg gag gat gac egg 528 
He Glu Gly He Asn Val Leu Gin Ser Pro Thr Leu Glu Asp Asp Arg 
165 170 175 

gaa aac ccg cgt att ttt gtt tec gat ttc ttt gat ttt teg att tat 576 
Glu Asn Pro Arg He Phe Val Ser Asp Phe Phe Asp Phe Ser He Tyr 
180 185 190 

gtg gat gcg gag gaa age egg att ttc act tgg tat tta gag cgt ttt 624 
Val Asp Ala Glu Glu Ser Arg He Phe Thr Trp Tyr Leu Glu Arg Phe 
195 200 205 

cgc ctg ctt egg gaa aca get ttt caa aat cct gat tea tat ttt cat 672 
Arg Leu Leu Arg Glu Thr Ala Phe Gin Asn Pro Asp Ser Tyr Phe His 
210 215 220 

aaa ttt aaa gac ttg tec gat cag gag get gac gag atg gca gee teg 720 
Lys Phe Lys Asp Leu Ser Asp Gin Glu Ala Asp Glu Met Ala Ala Ser 
225 230 235 240 

att tgg gag agt gtc aac egg ccg aat tta tat gaa aat att ttg cca 768 
He Trp Glu Ser Val Asn Arg Pro Asn Leu Tyr Glu Asn He Leu Pro 
245 250 255 

act aaa ttc agg tea gat etc att ttg cgt aag gga gac ggg cat aag 816 
Thr Lys Phe Arg Ser Asp Leu lie Leu Arg Lys Gly Asp Gly His Lys 
260 265 270 

gtc gag gaa gtg ttg gta agg agg gta tga 846 
Val Glu Glu Val Leu Val Arg Arg Val 
275 280 



<210> 23 
<211> 831 
<212> DNA 
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<213> Bacillus subtilis 

<220> 

<221> CDS 

<222> (1) . . (831) 

<4O0> 23 

atg aaa aca aaa ctg gat ttt eta aaa atg aag gag tct gaa gaa ccg 48 
Met Lys Thr Lys Leu Asp Phe Leu Lys Met Lys -Glu Ser <51u <Uu Pro 
1 5 10 15 



att gtc atg -ctg acc get tat gat tat ccg gca get aaa <:tt get gaa 96 
lie Val Met Leu Thr Ala Tyr Asp Tyr Pro Ala Ala Lys Leu Ala <51u 
20 25 30 



caa gcg gga gtt gac atg att tta gte ggt gat tea <:tt gga atg gtc 144 
Gin Ala Gly Val Asp Met He Leu Val Gly Asp Ser Leu Gly Wet Val 
35 40 ^5 



gtc etc ggc ctt gat tea act gtc ggt gtg aca gtt gcg gac atg ate 
Val Leu Gly Leu Asp Ser Thr Val Gly Val Thr Val Ala Asp Met He 
50 55 60 . 



aca gat atg ccg ttt atg tct tat cae ctg tct aag gaa gat aeg -ctg 
Thr Asp Met Pro Phe Met Ser Tyr His Leu Ser Lys Glu Asp Thr Leu 
85 90 95 



ctt gag ggc gga gaa ggc gtg ttt gaa *ce att egc gca ttg aeg -ctt 
Leu Glu Gly Gly Glu Gly Val Phe Glu Ser He Arg Ala Leu Thr Leu 
115 120 • 125 

gga ggc att cca gta gtc agt cae tta ggt ttg aca ccg eag tea gtc 
Gly Gly He Pro Val Val Ser His Leu <Sly -Leu Thr Pro <31n Ser Val 
130 135 140 

ggc gta ctg ggc ggc tat aaa gta cag ggc aaa gac gaa eaa age gee 
Gly Val Leu Gly Gly Tyr Lys Val Gin Gly Lys Asp Glu <31n *Ser Ala 
14 5 150 155 1-60 

aaa aaa tta ata gaa gac agt ata aaa tgc gaa gaa gca gga get ato, 
Lys Lys Leu lie Glu Asp Ser He Lys «Cys Glu 131u Ala XUy Ala Wet 
165 170 175 

atg ctt gtg ctg gaa tgt gtg ecg gca gaa etc aca jgee aaa att gee 
Met Leu Val Leu Glu Cys Val Pro Ala XUu Leu Thr Ala -Lys lie Ala 
180 185 190 

gag aeg eta age ata ccg gtc att gga ate ggg get ggt <gtg aaa gcg 
Glu Thr Leu Ser He Pro Val He Gly He Gly Ala <31y Val Lys Ala 
195 200 205 



192 



cat cat aca aaa gec gtt aaa agg ggt gcg ecg aat acc ttt att *gt^ 240 
His His Thr Lys Ala Val Lys Arg Gly Ala Pro Asn Thr Phe tie Val 
65 70 • 75 80 



288 



aaa aat gca gcg get ate gtt eag gaa age gga get gac gca ctg aag 336 
Lys Asn Ala Ala Ala He Val Gin Glu Ser Gly Ala Asp Ala Leu Lys 
100 105 HO 



384 



432 



480 



528 
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gac gga caa gtt etc gtt tat cat gat att ate ggc cac ggt gtt gag 672 
Asp Gly Gin Val Leu Val Tyr His Asp lie lie Gly His Gly Val Glu 
210 215 220 

aga aca cct aaa ttt gta aag caa tat acg cgc att gat gaa acc ate 720 
Arg Thr Pro Lys Phe Val Lys Gin Tyr Thr Arg lie Asp Glu Thr He 
225 230 235 240 

gaa aca gca ate age gga tat gtt cag gat gta aga cat cgt get ttc 768 
Glu Thr Ala He Ser Gly Tyr Val Gin Asp Val Arg His Arg Ala Phe 
245 250 255 

cct gaa caa aag cat tec ttt caa atg aac cag aca gtg ctt gac ggc 816 
Pro Glu Gin Lys His Ser Phe Gin Met Asn Gin Thr Val Leu Asp Gly 
260 265 270 

ttg tac ggg gga aaa 831 
Leu Tyr Gly Gly Lys 
275 



<210> 24 
<211> 277 
<212> PRT 

<213> Bacillus subtilis 
<400> 24 

Met Lys Thr Lys Leu Asp Phe Leu Lys Met Lys Glu Ser Glu Glu Pro 
1 5 10 15 

He Val Met Leu Thr Ala Tyr Asp Tyr Pro Ala Ala Lys Leu Ala Glu 
20 25 30 

Gin Ala Gly Val Asp Met He Leu Val Gly Asp Ser Leu Gly Met Val 
35 40 45 

Val Leu Gly Leu Asp Ser Thr Val Gly Val Thr Val Ala Asp Met He 
50 55 60 

His His Thr Lys Ala Val Lys Arg Gly Ala Pro Asn Thr Phe He Val 
65 70 75 80 

Thr Asp Met Pro Phe Met Ser Tyr His Leu Ser Lys Glu Asp Thr Leu 
85 90 95 

Lys Asn Ala Ala Ala He Val. Gin Glu Ser Gly Ala Asp Ala Leu Lys 
100 105 ' 110 

Leu Glu Gly Gly Glu Gly Val Phe Glu Ser He Arg Ala Leu Thr Leu 
115 120 125 

Gly Gly He Pro Val Val Ser His Leu Gly Leu Thr Pro Gin Ser Val 
130 135 140 

Gly Val Leu Gly Gly Tyr Lys Val Gin Gly Lys Asp Glu Gin Ser Ala 
145 150 155 160 
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Lys Lys Leu lie Glu Asp Ser lie Lys Cys Glu Glu Ala Gly Ala Met 
165 170 175 

Met Leu Val Leu Glu Cys Val Pro Ala Glu Leu Thr Ala Lys lie Ala 
180 185 190 

Glu Thr Leu Ser He Pro Val lie Gly lie Gly Ala Gly Val, Lys Ala 
195 200 205 

Asp Gly Gin Val Leu Val Tyr His Asp He lie £ly His <31y Val Glu 
210 215 220 

Arg Thr Pro Lys Phe Val Lys -Gin Tyr Thr Arg lie Asp -Glu Thr He 
225 ' 230 235 240 

Glu Thr Ala lie Ser Gly Tyr Val Gin Asp Val Arg His Arg Ala Phe 
24 5 250 255 

Pro Glu Gin Lys His Ser Phe Gin Met Asn Gin Thr Val Leu Asp Gly 
260 265 . 270 

Leu Tyr Gly Gly Lys 
275 



<210> 25 
<211> 858 
<212> DNA 

<213> Bacillus subtilis 

<220> 

<221> CDS 

<222> (1) . . (858) 

<400> 25 

atg aga cag att act gat att tea cag ctg aaa gaa gec ata aaa.caa 

Met Arg Gin He Thr Asp He Ser Gin Leu Lys Glu Ala He Lys -Gin 

1 5 10 15 

tac cat tea gag ggc aag tea ate -gga ttt gtt eeg aeg atg ggg ttt 
Tyr His Ser Glu Gly Lys Ser He Gly Phe Val Pro Thr Met Gly -Phe 
20 25 30 

ctg cat gag ggg cat tta ace tta gca gae aaa gca aga eaa gaa aac 
Leu His Glu Gly His Leu Thr Leu Ala Asp Lys Ala Arg -Gin Glu Asn 
35 40 45 

gac <;cc gtt att atg agt att ttt gtg aat cet gca eaa ttc ggc cet 
Asp Ala Val lie Met Ser lie Phe Val Asn Pro Ala XSln Phe Gly Pro 
50 55 60 

aat gaa gat ttt gaa gca tat ccg egc gat att gag egg gat gca get 
Asn Glu Asp Phe Glu Ala Tyr Pro Arg Asp He Glu Arg Asp Ala Ala 
65 70 75 80 

ctt gca ^aa aac gee gga gtc gat att ett ttt aeg eea gat get eat 
Leu Ala Glu Asn Ala Gly Val Asp He Leu Phe Thr Pro Asp Ala His 
85 90 95 



48 



96 



144 



192 



240 



288 
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gat atg tat occ ggt gaa aag aat gtc acg att cat gta gaa aga cgc 336 

Asp Met Tyr Pro Gly Glu Lys Asn Val Thr He His Val Glu Arg Arg 
100 105 110 

aca gac gtg tta tgc ggg cgc tea aga gaa gga cat ttt gac ggg gtc 384 

Thr Asp Val Leu Cys Gly Arg Ser Arg Glu Gly His Phe Asp Gly Val 
115 120 125 

gcg ate gta ctg acg aag ett ttc aat eta gtc aag ccg act cgt gee 4 32 

Ala He Val Leu Thr Lys Leu Phe Asn Leu Val Lys Pro Thr Arg Ala 
130 135 140 

tat ttc ggt tta aaa gat gcg cag eag gta get gtt gtt gat ggg tta 4 80 

Tyr Phe Gly Leu Lys Asp Ala Gin Gin Val Ala Val Val Asp Gly Leu 

145 150 155 160 

ate age gac ttc ttc atg gat att gaa ttg gtt cct gtc gat acg gtc 528 

He Ser Asp Phe Phe Met Asp He Glu Leu Val Pro Val Asp Thr Val 

165 170 ' 175 

aga gag gaa gac ggc tta gec aaa age tct cgc aat gta tac tta aca 57 6 

Arg Glu Glu Asp Gly Leu Ala Lys Ser Ser Arg Asn Val Tyr Leu Thr 
180 185 190 

get gag gaa aga aaa gaa gcg cct aag ctg tat egg gee ctt caa aca 624 

Ala Slu Glu Arg Lys Glu Ala Pro Lys Leu Tyr Arg Ala Leu Gin Thr 
195 - 200 205 

agt gcg gaa ctt gtc eaa gee ggt gaa aga gat cct gaa gcg gtg ata 672 

Ser Ala Glu Leu Val <Un Ala Gly Glu Arg Asp Pro Glu Ala Val He 
210 215 220 

aaa get gca aaa gat ate att gaa acg act age gga ace ata gac tat 720 

Lys Ala Ala Lys Asp He lie Glu Thr Thr Ser Gly Thr He Asp Tyr 

225 230 235 240 

gta gag ctt tat tec tat ccg gaa etc gag cct gtg aat gaa att get 768 

Val Glu Leu Tyr Ser Tyr Pro Glu Leu Glu Pro Val Asn Glu He Ala 

245 . 250 255 

gga aag atg att cte get gtt gca gtt get ttt tea aaa gcg cgt tta 816 

Gly Lys Met He Leu Ala Val Ala Val Ala Phe Ser Lys Ala Arg Leu 
260 265 270 

ata gat aat ate att att gat att cga gaa atg gag aga ata 858 
He Asp Asn He He He Asp He Arg Glu Met Glu Arg He 
275 280 285 



<210> 26 
<211> 286 
<212> PRT 

<213> Bacillus subtilis 
<400> 2€ 

Met Arg Gin He Thr Asp lie Ser Gin Leu Lys Glu Ala lie Lys Gin 
15 io 15 
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Tyr His Ser Glu Gly Lys Ser He Gly Phe Val Pro Thr Met Gly Phe 
20 25 30 

' Leu His Glu Gly His Leu Thr Leu Ala Asp iys Ala Arg Gin Glu Asn 
35 40 45 

Asp Ala Val He Met Ser He Phe Val Asn.Pro Ala -Gin Phe Gly Pro 
50 55 60 

Asn Glu Asp Phe Glu Ala Tyr Pro Arg Asp lie *Glu Arg Asp Ala Ala 
65 70 75 80 

Leu Ala Glu Asn Ala Gly Val Asp He Leu Phe Thr Pro Asp Ala His 
85 90 95 

Asp Met Tyr Pro Gly Glu Lys Asn Val Thr He Hi-s Val Glu Arg Arg 
100 105 HO 

Thr Asp Val Leu Cys Gly Arg Ser Arg Glu Gly His ,Phe Asp Gly Val 
115 ^ 12() 125 

Ala lie Val Leu Thr Lys Leu Phe Asn Leu Val Lys Pro Thr Arg Ala 
130 135 140 

Tyr Phe Gly Leu Lys Asp, Ala G.ln r Gln Val Ala Val Val Asp -Gly Leu 
145 150 155 160 

He Ser Asp Phe Phe Met Asp He Glu Leu Val Pro Val Asp Thr Val 
165 170 175 

Arg Glu Glu Asp Gly Leu Ala Lys Ser Ser Arg Asn Val Tyr Leu Thr 
180 185 190 

Ala Glu Glu Arg Lys Glu Ala Pro Lys Leu Tyr Arg Ala Leu Gin Thr 
195 200 205 

Ser Ala Glu Leu Val Gin Ala Gly Glu Arg Asp -Pro Glu Ala Val lie 
210 215 220 

Lys Ala Ala Lys Asp He He *Glu Thr Thr Ser Gly Thr He Asp Tyr 
225 230 235 240 

Val Glu Leu Tyr Ser Tyr Pro Glu Leu Glu Pro Val Asn -Glu He Ala 
24 5 250 255 

Gly Lys Met He Leu Ala Val Ala Val Ala Phe Ser <Lys Ala Arg Leu 
2-60 265 270 

He Asp Asn He He He Asp He Arg -Glu Met Glu Arg He 
275 280 285 



<210> 27 
<211> 381 
<212> DNA 

<213> Bacillus subtilis 



GNsnrr.ir* <wo 
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<220> 

<221> CDS 

<222> (1) . . (381) 



<400> 27 

atg tat cga aca atg atg age ggc aaa ctt cac agg gca act gtt acg ,48 
Met Tyr Arg Thr Met Met Ser Gly Lys Leu His Arg Ala Thr Val Thr 
1 5 10 15 

gaa gca aac ctg aac tat gtg gga age att aca att gat gaa gat etc 96 
Glu Ala Asn Leu Asn Tyr Val Gly Ser lie Thr lie Asp Glu Asp Leu 
20 25 30 

att gat get gtg gga atg ctt cct aat gaa aaa gta caa att gtg aat 144 
lie Asp Ala Val Gly Met Leu Pro Asn Glu Lys Val Gin lie Val Asn 
35 40 45 

aat aat aat gga gca cgt ctt gaa acg tat att att cct ggt aaa egg 192 
Asn Asn Asn Gly Ala Arg Leu Glu Thr Tyr lie He Pro Gly Lys Arg 
50 55 60 * 



gga age ggc gtc ata tgc tta aac ggt gca gee gca cgc ctt gtg cag 240 
Gly Ser Gly Val He Cys Leu Asn Gly Ala Ala Ala Arg Leu Val Gin 
65 70 75 " 80 

gaa gga gat aag gtc att att att tec tac aaa atg atg tct gat caa 288 
Glu Gly Asp Lys Val He He lie Ser Tyr Lys Met Met Ser Asp Gin 
85 90 95 

gaa gcg gca age cat gag ccg aaa gtg get gtt ctg aat gat caa aac 336 
Glu Ala Ala Ser His Glu Pro Lys Val Ala Val Leu Asn Asp <31n Asn 
100 105 110 

aaa att gaa caa atg ctg ggg aac gaa cca gee cgt aca att ttg 381 
Lys lie Glu Gin Met Leu Gly Asn Glu Pro Ala Arg Thr "lie Leu 
115 120 125 



<210> 28 
<211> 127 
<212> PRT 

<213> Bacillus subtilis 



<400> 28 

Met Tyr Arg Thr Met Met Ser Gly Lys Leu His Arg Ala Thr Val Thr 
15 10 15 

Glu Ala Asn Leu Asn Tyr Val Gly Ser lie Thr lie Asp Glu Asp Leu 
20 25 30 

lie Asp Ala Val Gly Met Leu Pro Asn Glu Lys Val Gin lie Val Asn 
35 40 45 

Asn Asn Asn Gly Ala Arg Leu Glu Thr Tyr lie lie Pro Gly Lys Arg 
50 55 60 

Gly Ser Gly Val lie Cys Leu Asn Gly Ala Ala Ala Arg Leu Val Gin 
65 70 75 80 



WO 01/21772 PCT/US00/25993 

-32- 



Glu Gly Asp Lys Val He He lie Ser Tyr Lys **et Met Ser Asp <Sln 
85 90 95 

Glu Ala Ala Ser His «Glu Pro iys Val Ala Val -Leu Asn Asp -Gin Asn 
100 105 HO 

Lys lie Glu -Gin Met Leu Gly Asn Glu Pro. Ala Arg Thr lie Leu 
115 120 125 



<210> 29 
<211> 894 
<212> DNA 

<213> Bacillus subtilis 

<220> 

<221> CDS 

<222> (1) . . (894) 

<400> 29 

atg aaa att gga att ate -ggc gga ggc tec -gtt ggt ctt tta tgc gec 
Met Lys He Gly He He -Gly <Gly Gly Ser Val <Uy Leu Leu Cys Ala 
15 10 W 

tat tat ttg tea ctt tat cac gac gtg act gtt gtg acg agg egg caa 
Tyr Tyr Leu Ser Leu Tyr His Asp Val Thr Val Val Thr Arg Arg "Glfi 
20 25 30 

gaa cag get gcg gee att cag tct gaa gga ate egg ctt tat aaa ggc 
Glu Gin Ala Ala Ala He -Gin Ser -Glu -Gly He Arg Leu Tyr Lys -Gly 
35 40 45 

ggg gag gaa ttc agg get gat tgc agt gcg gac acg agt ate aat teg 
Giy Glu <Uu Phe Arg Ala Asp Cys Ser Ala Asp Thr Ser lie Asn Ser 
50 55 60 

gac ttt gac ctg ctt gtc gtg aca gtg aag cag cat -cag ctt caa tct 
Asp Phe Asp Leu Leu Val Val Thr Val Lys -Gin His Gin Leu <3in Ser 
65 70 75 80 

gtt ttt teg teg ctt gaa cga ate ggg aag acg aat ata tta ttt ttg 
Val Phe Ser Ser Leu Glu Arg He Gly Lys Thr Asn He Leu Phe Leu 
85 90 95 

caa aac ggc atg ggg cat ate cac gac eta aaa gac tgg cac gtt -ggc 
Gin Asn Gly Met <31y His He His Asp Leu Lys Asp Trp His Val «Gly 
100 105 liO 

cat tec att tat gtt gga ate gtt gag cac gga get gta aga aaa teg 
His Ser He Tyr Val <31y He Val <Slu His <Sly Ala Val Arg Lys Ser 
115 120 125 



43 



96 



1-4 4 



192 



240 



238 



336 



384 



gat aca get gtt gat cat aca ggc eta ggt gcg ata aaa tgg age gcg 4 32 
Asp Thr Ala Val Asp His Thr Gly Leu Gly Ala He Lys Trp Ser Ala 
130 135 140 



ttc gac gat get gaa cca gac egg ctg aac ate ttg ttt cag cat aac 



480 
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Phe Asp Asp Ala Glu Pro Asp Arg Leu Asn He Leu Phe Gin His Asn* 
145 150 155 160 

cat teg gat ttt ccg att tat tat gag acg gat tgg tac cgt ctg ctg 528 
His Ser Asp Phe Pro He Tyr Tyr Glu Thr Asp Trp Tyr Arg Leu Leu 
165 170 175 

acg ggc aag ctg att gta aat gcg tgt att aat cct tta act gcg tta 576 
Thr Gly Lys Leu He Val Asn Ala Cys He Asn Pro Leu Thr Ala Leu 
180 185 190 

ttg caa gtg aaa aat gga gaa ctg ctg aca acg cca get tat ctg get 624 
Leu Gin Val Lys Asn Gly Glu Leu Leu Thr Thr Pro Ala Tyr Leu Ala 
195 200 205 

ttt atg aag ctg gta ttt cag gag gca tgc cgc att tta aaa ctt gaa 672 

Phe Met Lys Leu Val Phe Gin Glu Ala Cys Arg He Leu Lys Leu Glu 
210 215 220 

i 

aat gaa gaa aag get tgg gag egg gtt cag gec gtt tgt ggg caa acg 720 

Asn Glu Glu Lys Ala Trp Glu Arg Val -Gin Ala Val Cys Gly Gin Thr 
225 230 235 * 240 

aaa gag aat cgt tea tea atg ctg gtt gac gtc att gga ggc egg cag 768 
Lys Glu Asn Arg Ser Ser Met Leu Val Asp Val He Gly Gly Arg Gin 
245 250 255 

acg gaa get gac gee att ate gga tac tta ttg aag gaa gca agt ctt 816 
Thr Glu Ala Asp Ala He He Gly Tyr Leu Leu Lys Glu Ala Ser Leu 
260 265 270 

caa ggt ctt gat gee gtc cac eta gag ttt tta tat ggc age ate aaa 864 
Gin Gly Leu Asp Ala Val His Leu Glu Phe Leu Tyr -Gly Ser He Lys 
275 280 285 

gca ttg gag cga aat aca aac aaa gtc ttt 894 
Ala Leu Glu Arg Asn Thr Asn Lys Val Phe 
290 295 



<210> 30 
<211> 298 
<212> PRT 

<213> Bacillus subtilis 
<400> 30 

Met Lys He Gly He He Gly Gly Gly Ser Val Gly Leu Leu Cys Ala 
1 5 10 15 

Tyr Tyr Leu Ser Leu Tyr His Asp Val Thr Val Val Thr Arg Arg Gin 
20 25 30 

Glu Gin Ala Ala Ala He Gin Ser Glu Gly He Arg Leu Tyr Lys Gly 
35 40 45 

Gly Glu Glu Phe Arg Ala Asp Cys Ser Ala Asp Thr Ser lie Asn Ser 
50 55 60 
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Asp Phe Asp Leu Leu Val Val Thr Val Lys <51n Hi's Gin Leu <31n Ser 

65 70 75 *0 

Val Phe Ser Ser Leu £lu Arg lie Gly Lys Thr Asn lie Leu Phe Leu 
• 85 90 95 

Gin Asn Gly Met Gly His He His Asp Leu Lys Asp Trp Ms Val <Gly 
100 105 110 

His Ser He Tyr Val Gly He Val Glu His Gly Ala Val Arg Lys Ser 
115 120 125 

Asp Thr Ala Val Asp His Thr -Gly Leu Gly Ala He Lys Tr-p Ser Ala 
130 135 * 140 

Phe Asp Asp Ala Glu Pro Asp Arg Leu Asn He Leu Phe -Gin His Asn 
145 150 155 1*0 

His Ser Asp Phe Pro He Tyr Tyr Glu Thr Asp Trp Tyr Arg Leu Leu 
165 170 , 175 

' Thr Gly Lys Leu He Val Asn Ala Cys He Asn Pro Leu Thr Ala Leu 
180 185 WO 

Leu Gin Val Lys Asn Gly Glu Leu Leu Thr Thr Pro Ala Tyr Leu Ala 
195 » 20fl • 2 *> 5 

Phe Met Lys Leu Val Phe Gin Glu Ala Cys Arg He Leu Lys Leu .Glu 
210 215 . 220 

Asn Glu Glu Lys Ala Trp Glu Arg Val <Gln Ala Val Cys .Gly Gin Thr 
225 230 235 240 

Lys Glu Asn Arg Ser Ser Met Leu Val Asp Val He *Gly Gly Arg Gin 
245 250 255 

Thr Glu Ala Asp Ala lie He Gly Tyr Leu Leu Lys Glu Ala Ser Leu 
260 265 270 

Gin Gly Leu Asp Ala Val His Leu 'Glu Phe Leu Tyr Gly Ser He Lys 
275 280 285 

Ala Leu Glu Arg Asn Thr Asn Lys Val Phe 
290 295 

<210> 31 
<211> 1725 
<212> DNA 

<213> Bacillus subtilis 

<220> 

<221> CDS 

<222> <1> . - (1722) 

<400> 31 

atg ggg act aat gta cag gtg gat tea gca tct gec -gaa tgt aca cag 4 8 
Met Gly Thr Asn Val Gin Val Asp Ser Ala Ser Ala Glu Cys Thr Gin 
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1 5 10 15 

acg atg age gga gca tta atg ctg att gaa tea tta aaa aaa gag aaa 96 
Thr Met Ser Gly Ala Leu Met Leu He Glu Ser Leu Lys Lys Glu Lys 
20 ' 25 30 

gta gaa atg ate ttc ggt tat ccg ggc ggg get gtg ctt ceg att tac 144 
Val Glu Met He Phe Gly Tyr Pro Gly Gly Ala Val Leu Pro He Tyr 
35 40 45 

gat aag eta tac aat tea ggg ttg gta cat ate ctt ccc cgt cac gaa 192 
Asp Lys Leu Tyr Asn Ser Gly Leu Val His He Leu Pro Arg His Glu 
50 55 60 

caa gga gca att cat gca gcg gag gga tac gca agg gtc tec gga aaa 240 
Gin Gly Ala He His Ala Ala Glu Gly Tyr Ala Arg Val Ser Gly Lys 
65 70 75 80 

ccg ggt gtc gtc att gec acg tea ggg ccg gga gcg afca aac ctt gtt 288 
Pro Gly Val Val He Ala Thr Ser Gly Pro Gly Ala Thr Asn Leu Val 
85 90 95 

aca ggc ctt get gat gee atg att gat tea ttg ccg tta gtc gtc ttt 336 
Thr Gly Leu Ala Asp Ala Met He Asp Ser Leu Pro Leu Val Val Phe 
100 105 110 

aca ggg cag gta gca acc tct gta ate ggg age gat gea ttt cag gaa 384 
Thr Gly Gin Val Ala Thr Ser Val He Gly Ser Asp Ala Phe Gin Glu 
115 120 125 

gca gac att tta ggg att acg atg cca gta aca aaa cac age tac cag 4 32 
Ala Asp He Leu Gly He Thr Met Pro Val Thr Lys His Ser Tyr Gin 
130 135 140 

gtt cgc cag ccg gaa gat ctg ccg cgc ate att aaa gaa gcg ttc cat 480 
Val Arg Gin Pro Glu Asp Leu Pro Arg He He Lys Glu Ala Phe His 
145 150 155 160 

att gca aca act gga aga ccc gga cct gta ttg att gat att ccg aaa 528 
He Ala Thr Thr Gly Arg Pro Gly Pro Val Leu He Asp He Pro Lys 
165 170 175 

gat gta gca aca att gaa gga gaa ttc age tac gat cat gag atg aat 576 
Asp Val Ala Thr He Glu Gly Glu Phe Ser Tyr Asp His Glu Met Asn 
180 185 190 

etc ccg gga tac cag ccg aca aca gag ccg aat tat ttg cag ate cgc 624 
Leu Pro tSly Tyr Gin Pro Thr Thr Glu Pro Asn Tyr Leu Gin He Arg 
195 200 205 

aag ctt gtg gaa gee gtg age agt gcg aaa aaa ccg gtg ate ctg gcg 672 
Lys Leu Val Glu Ala Val Ser Ser Ala Lys Lys Pro Val He Leu Ala 
210 215 220 

ggt gcg ggc gta ctg cac gga aaa geg tea gaa gaa tta aaa aat tat 720 
Gly Ala Gly Val Leu His Gly Lys Ala Ser Glu Glu Leu Lys Asn Tyr 
225 230 235 240 
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get cjaa cag cag caa ate cct gtg gca cac acc ctt ttg ggg etc gga 7*8 
Ala -Glu Gin Gin Gin He Pro Val Ala -His Thr Leu Leu Gly Leu Gly 
24 5 250 £55 

ggc ttc ccg get -gac cat ccg ctt ttc eta -ggg atg gcg gga atg cac 8*1-* 
Gly Phe Pro Ala Asp His Pro Leu Phe Leu *Gly Met Ala Gly Met His 
260 265 . 270 



ggt act tat aca gec aat atg gee ctt cat gaa tgt gat eta tta ate 

Gly Thr Tyr Thr Ala Asn Met Ala Leu His <51u Cys Asp Leu Leu lie 

275 280 285 

agt ate ggc gec cgt ttt gat -gac cgt gtc aca gga aac ctg aaa cac 

Ser He Gly Ala Arg Phe Asp Asp Arg Val Thr Gly Asn Leu Lys His 

290 295 300 

ttt gec aga aac gca aag ata gec -cac ate gat att gat cca get gaa 
Phe Ala Arg Asn Ala Lys He Ala His He Asp He Asp Pro Ala -Glu 

305 310 315 320 



gaa 



gta ggc cag cat caa atg tgg tea gcg caa ttt tat ccg ttc -caa aaa 
Val Gly Gin His Gin Met Trp *Ser Ala Gin Phe Tyr Pro -Phe Gin -Lys 
405 410 415 



gtc gcg gtt gtc gga gac ggc gga ttc caa atg acg ctt caa -gaa etc 

Val Ala Val Val Gly Asp Gly Gly Phe Gin Met Thr Leu Gin Glu Leu 
450 4 55 4*0 

gat gtt att cgc gaa tta aat ctt ccg gtc aag gta gtg att tta aat 

Asp Val He Arg Glu Leu Asn Leu Pro Val Lys Val Val He Leu Asn 



8*4 



913 



960 



ate gga aaa ate atg aaa aca cag att cct gta gtc g^a gac age aaa 1008 
He Gly Lys He Met Lys Thr Gin He -Pro Val Val -Gly Asp Ser Lys 
325 330 335 

att gtc ctg cag gag ctg ate aaa caa gac ggc aaa caa age gat tea 105* 
He Val Leu Gin Glu Leu . He Lys Gin Asp *Gly Lys -Gin Ser Asp Ser 
340 345 350 

age gaa tgg aaa aaa cag etc gca gaa tgg aaa cjaa gag tat ccg -etc 1104 
Ser Glu Trp Lys Lys Gin Leu Ala Glu Trp Lys Glu Glu Tyr Pro Leu 
355 360 365 

tgg tat gta gat aat gaa gaa gaa ggt ttt aaa cct cag aaa ttg att 1152 
Trp Tyr Val Asp Asn Glu Glu *Glu Gly Phe Lys Pro Gin Lys Leu He 
370 375 380 



tat att cat caa ttt aca aaa gga gag gec att gtc gca acg gat 1200 
Glu Tyr He His Gin Phe Thr Lys Gly Glu Ala He Val Ala Thr Asp 
385 390 395 400 



1248 



gca gat aaa tgg gtc acg tea ggc gga ctt gga acg atg gga ttc ggt 129* 
Ala Asp Lys Trp Val Thr Ser Gly -Gly Leu Gly Thr Met Gly Phe Gly 
420 425 430 

ctt ccg gcg gcg ate ggc gca cag ctg gee gaa aaa gat get act -gtt 134 4 
Leu Pro Ala Ala He Gly Ala Gin Leu Ala Glu Lys Asp Ala Thr Val 
435 440 445 



1392 



1440 
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465 470 475 480 

aac get tgt etc gga atg gtc aga cag tgg cag gaa att ttc tat gaa 1488 
Asn Ala Cys Leu Gly Met Val Arg Gin Trp Gin Glu He Phe Tyr Glu 
485 490 495 

gaa cgt tat tea gaa tct aaa ttc get tct cag cct gac ttc gtc aaa 1536 
Glu Arg Tyr Ser Glu Ser Lys Phe Ala Ser Gin Pro Asp Phe Val Lys 
500 505 510 

ttg tec gaa gca tac ggc att aaa ggc ate aga att tea tea gaa gcg 1584 
Leu Ser Glu Ala Tyr Gly He Lys Gly He Arg He Ser Ser Glu Ala 
515 520 525 

gaa gca aag gaa aag ctg gaa gag gca tta aca tea aga gaa cct gtt 1632 
Glu Ala Lys Glu Lys Leu Glu Glu Ala Leu Thr Ser Arg Glu Pro Val 
530 535 540 

gtc att gac gtg egg gtt gec age gaa gaa aaa gta ttc ccg atg gtg 1680 
Val He Asp Val Arg Val Ala Ser Glu Glu Lys Val Phe Pro Met Val 
545 550 555 560 

i 

get ccg ggg aaa ggg ctg cat gaa atg gtg ggg gtg aaa cct tga 1725 
Ala Pro Gly Lys Gly Leu His Glu Met Val Gly Val Lys Pro 
565 570 



<210> 32 
<211> 574 
<212> PRT 

<213> Bacillus subtilis 
<400> 32 

Met Gly Thr Asn Val Gin Val Asp Ser Ala Ser Ala Glu Cys Thr Gin 
1 5 10 15 

Thr Met Ser Gly Ala Leu Met Leu He Glu Ser Leu Lys Lys Glu Lys ' 
20 25 30 

Val Glu Met He Phe Gly Tyr Pro Gly Gly Ala Val Leu Pro He Tyr 
35 40 45 

Asp Lys Leu Tyr Asn Ser Gly Leu Val His He Leu Pro Arg His Glu 
50 55 60 

Gin Gly Ala He His Ala Ala Glu Gly Tyr Ala Arg Val Ser Gly Lys 
65 70 75 80 

Pro Gly Val Val He Ala Thr Ser Gly Pro Gly Ala Thr Asn Leu Val 
85 90 95 

Thr Gly Leu Ala Asp Ala Met He Asp Ser Leu Pro Leu Val Val Phe 
100 105 110 

Thr Gly Gin Val Ala Thr Ser Val He Gly Ser Asp Ala Phe Gin Glu 
115 120 125 

Ala Asp lie Leu Gly lie Thr Met Pro Val Thr Lys His Ser Tyr Gin 



m 
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130 135 140 

Val Arg Gin Pro Glu Asp Leu Pro Arg He He Lys Glu Ala Phe His 
14 5 150 155 160 

i 

He Ala Thr Thr Gly Arg Pro Gly Pro Val Leu He Asp He Pro Lys 
165 ' 170 175 

Asp Val Ala Thr He Glu Gly Glu Phe Ser Tyr Asp His -Glu Met Asn 
180 185 190 

Leu Pro Gly Tyr Gin Pro Thr Thr Glu Pro Asn Tyr Leu Gin lie Arg 
195 200 205 

Lys Leu Val Glu Ala Val Ser Ser Ala Lys Lys Pro Val He -Leu Ala 
210 215 220 

Gly Ala Gly Val Leu His Gly Lys Ala Ser Glu Glu Leu Lys Asn Tyr 
225 230 235 240 

Ala Glu Gin Gin Gin He Pro Val Ala His Thr Leu Leu Gly -Leu Gly 
245 250 255 

Gly Phe Pro Ala Asp His Pro Leu Phe Leu Gly Met Ala Gly Met His 
260 265 ' 270 

Gly Thr Tyr Thr Ala Asn Met Ala 1 Leu His Glu Cys Asp -Leu Leu lie 
. 275 280 285 

Ser He Gly Ala Arg Phe Asp Asp Arg Val Thr Gly Asn Leu Lys -His 
290 295 300 

Phe Ala Arg Asn Ala Lys He Ala His He Asp lie Asp Pro Ala Glu 
305 310 315 320 

He Gly Lys He Met Lys Thr Gin He Pro Val Val Gly Asp Ser Lys 
325 330 335 

He Val Leu Gin Glu Leu He Lys Gin Asp Gly Lys GLn Ser Asp "Ser 
340 345 350 

Ser Glu Trp Lys Lys Gin Leu Ala Glu Txp Lys Glu Glu Tyr Pro Leu 
355 360 365 

Trp Tyr Val Asp Asn Glu Glu Glu Gly Phe Lys Pro Gin Lys Leu lie 
370 * 375 380 

Glu Tyr He His Gin Phe Thr Lys Gly Glu Ala He Val Ala Thr Asp 
385 390 395 400 

Val Gly Gin His Gin Met Trp Ser Ala Gin Phe Tyr Pro Phe Gin Lys 
405 410 415 

Ala Asp Lys Trp Val Thr Ser Gly Gly Leu Gly Thr Met Gly Phe Gly 
420 425 430 

Leu Pro Ala Ala He Gly Ala Gin Leu Ala Glu Lys Asp Ala Thr Val 
435 440 445 
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Val Ala Val Val Gly Asp Gly Gly Phe Gin Met Thr Leu Gin Glu Leu 
450 455 460 

Asp Val lie Arg Glu Leu Asn Leu Pro Val Lys Val Val lie Leu Asn 
465 470 475 480 

Asn Ala Cys Leu Gly Met Val Arg Gin Trp Gin Glu He Phe Tyr Glu 
485 490 • 495 

Glu Arg Tyr Ser Glu Ser Lys Phe Ala Ser Gin Pro Asp Phe Val Lys 
500 505 5ia 

Leu Ser Glu Ala Tyr Gly lie Lys Gly He Arg lie Ser Ser Glu Ala 
515 520 525 

Glu Ala Lys Glu Lys Leu Glu Glu Ala Leu Thr Ser Arg Glu Pro Val 
530 535 $40 

Val He Asp Val Arg Val Ala Ser Glu Glu Lys Val Phe * Pro Met Val 
545 550 555 560 



Ala Pro Gly Lys Gly Leu His Glu Met Val Gly Val Lys Pro 
565 570 



<210> 33 
<211> 525 
<212> DNA 

<213> Bacillus subtilis 

<220> 

<221> CDS 

<222> (1) . . (522) 

<400> 33 

ttg aaa aga att ate aca ttg act gtg gtg aac cgc tec ggg gtg tta 4 8 

Met Lys Arg He He Thr Leu Thr Val Val Asn Arg Ser Gly Val Leu 

15 10 15 

aac egg ate ace ggt eta ttc aca aaa agg cat tac aac att gaa age 96 
Asn Arg He Thr Gly Leu Phe Thr Lys Arg His Tyr Asn He Glu Ser 
20 25 30 

att aca gtt gga cac aca gaa aca gee ggc gtt tec aga ate ace ttc 14 4 
He Thr Val Gly His Thr Glu Thr Ala Gly Val Ser Arg He Thr Phe 
35 40 45 

gtc gtt cat gtt gaa ggt gaa aat gat gtt gaa cag tta acg aaa cag 192 
Val Val His Val Glu Gly Glu Asn Asp Val Glu Gin Leu Thr Lys Gin 
50 55 60 

etc aac aaa cag att gat gtg ctg aaa gtc aca gac ate aca aat caa 240 
Leu Asn Lys Gin He Asp Val Leu Lys Val Thr Asp He Thr Asn Gin 
65 70 75 80 



teg att gtc cag agg gag ctg gee tta ate aag gtt gtc tec gca cct 288 
Ser He Val Gin Arg Glu Leu Ala Leu He Lys Val Val Ser Ala Pro 



• - t 
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85 90 95 

tea aca aga aca gag att aat gga ate ata ^aa ccg ttt aga gec tct 336 
Ser Thr Arg Thr Glu lie Asn Gly He He -Glu Pro Phe Arg Ala Ser 
100 105 110 

gtc gtt gat gtc age aga gac age ate gtt gtt ea<g gto. aea ggt gaa 384 
Val Val Asp Val Ser Arg Asp Ser lie Val Val Gin Val Thr ,Gly Glu 
115 120 125 

tct aac aaa att gaa -gcg ctt att gag tta tta aaa ect tat ggc att 432 
Ser Asn Lys He -Glu Ala Leu He *Glu Leu Leu Lys Pro Tyr Gly He 
130 135 140 

aaa gaa ate gcg aga aca ^gt aca aeg get ttt gcg agg gga aec age 480 
Lys Glu He Ala Arg Thr -Gly Thr Thr Ala Phe Ala Arg Gly Thr Ser 
145 150 155. 1*0 



aaa agg cgt cat oca ata aaa caa tat eta ttc, tat aaa aca taa 
Lys Arg Arg His Pro He Lys Gin Tyr Leu Leu Tyr Lys Thr 
165 170 



525 



<210> 34 
<211> 174 
<212> PRT 

<213> Bacillus subtilis 4 • 1 

<400> 34 

Met Lys Arg He He Thr Leu Thr Val Val Asn Arg Ser -Gly Val Leu 
1-5 10 15 

Asn Arg He Thr Gly Leu Phe Thr Lys Arg His Tyr Asn He Glu Ser 
20 25 30 

He Thr Val Gly His Thr Glu Thr Ala Gly Val Ser Arg He Thr Phe 
35 40 45 

Val Val His Val Glu Gly -Glu Asn Asp Val <51u Gin Leu Thr Lys Gl*i 
50 55 60 

Leu Asn Lys Gin He Asp Val Leu Lys Val Thr Asp lie Thr Asn Gin 
65 70 75 80 

Ser He Val Gin Arg Glu Leu Ala Leu He Lys Val Val Ser Ala Pro 
85 90 95 

Ser Thr Arg Thr Glu He Asn Gly He He Glu Pro Phe Arg Ala Ser 
100 ios; 110 

Val Val Asp Val Ser Arg Asp Ser He Val Val -Gin Val Thr Gly Glu 
115 120 125 

Ser Asn Lys He <Slu Ala Leu He Glu Leu Leu Lys Pro Tyr -Gly He 
130 135 140 

Lys Glu He Ala Arg Thr -Gly Thr Thr Ala Phe Ala Arg Gly Thr Ser 
145 150 155 160 
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Lys Arg Arg His Pro lie Lys Gin Tyr Leu Leu Tyr Lys Thr 
165. 170 



<210> 35 
<211> 1029 
<212> DNA 

<213> Bacillus subtilis 

<220> 

<221> CDS 

<222> (1) . . (1026) 

<400> 35 

atg gta aaa gta tat tat aac ggt gat ate aaa gag aac gta ttg get 48 

Met Val Lys Val Tyr Tyr Asn Gly Asp He Lys Glu Asn Val Leu Ala 
1 5 10 15 

gga aaa aca gta gcg gtt ate ggg tac ggt ccg caa ggc' cac gca cat 96 
Gly Lys Thr Val Ala Val He Gly Tyr Gly Ser Gin Gly His Ala His 
20 25 30 

gee ctg aac ctt aaa gaa age gga gta gac gtg ate gtc ggt gtt aga 144 
Ala Leu Asn Leu Lys Glu Ser Gly Val Asp Val He Val Gly Val Arg 
35 40 45 

caa gga aaa tct ttc act caa gec caa gaa gac gga cat aaa gta ttt 192 
Gin Gly Lys Ser Phe Thr" Gin Ala Gin Glu Asp Gly His Lys Val Phe 
50 55 60 

tea gta aaa gaa gcg gca gec caa gec gaa ate ate atg gtt ctg ctt 240 
Ser Val Lys Glu Ala Ala Ala Gin Ala Glu He He Met Val Leu Leu 
65 70 75 80 

ccg gat gag cag cag caa aaa gta tac gaa get gaa ate aaa gat gaa 288 
Pro Asp Glu Gin Gin Gin Lys Val Tyr Glu Ala Glu He Lys Asp Glu 
85 90 95 

ttg aca gca gga aaa tea tta gta ttc get cat gga ttt aac gtg cat 336 
Leu Thr Ala Gly Lys Ser Leu Val Phe Ala His Gly Phe Asn Val His 
100 105 HO 

ttc cat caa att gtt cct ccg gcg gat gta gat gta ttc tta gtg gec 384 
Phe His Gin He Val Pro Pro Ala Asp Val Asp Val Phe Leu Val Ala 
115 120 125 

cct aaa ggc ccg gga cac ttg gta aga aga aca tat gag caa gga get 4 32 
Pro Lys Gly Pro Gly His Leu Val Arg Arg Thr Tyr Glu Gin Gly Ala 
130 135 140 

ggc gta cct gca ttg ttc gca ate tat caa gat gtg act gga gaa gca 480 
Gly Val Pro Ala Leu Phe Ala He Tyr Gin Asp Val Thr Gly Glu Ala 
145 150 155 160 

aga gac aaa gee etc get tat get aaa gga ate ggc ggc gca aga gcg 528 
Arg Asp Lys Ala Leu Ala Tyr Ala Lys Gly He Gly Gly Ala Arg Ala 
165 170 ' 175 



WO 01/21772 



PCT/USOO/25993 



42- 



ggc gta tta gaa acg aca ttt aaa gaa gaa aca gaa aca gat ttg ttc S7S 
Gly Val Leu Glu Thr Thr Phe Lys Glu Glu Thr Glu Thr Asp Leu Phe 
180 185 190 

ggt gag caa gca gtt ctt tgc g,gc gga tta age gcg ctt gtc aaa gec 624 
Gly Glu Gin Ala Val Leu Cys Gly Gly Leu Ser Ala Leu Val Lys Ala 
195 200 205 



gga ttt gaa acc tta act gaa gca ggt tat cag cct gaa ctt gca tac 
Gly Phe Glu Thr Leu Thr Glu Ala Gly Tyr -Gin Pro Glu Leu Ala Tyr 
210 215 220 

ttc gag tgt ctt cat gag ctg aaa tta ate gta gac ctt atg tac gaa 
Phe Glu Cys Leu His Glu Leu Lys Leu lie Val Asp Leu Met Tyr -Glu 
225 ' 230 235 240 

gaa gga ctt gca gga atg aga tat tea ate tct gac aca, gca cag tgg 
Glu Gly Leu Ala Gly Met Arg Tyr Ser He Ser Asp Thr Ala Gin Trp 
245 250 255 

gga gat ttc gta tea ggc ect cgc gtt gtg gac gec aaa gta aaa gaa 
Gly Asp Phe Val Ser Gly Pro Arg Val Val Asp Ala Lys Val Lys Glu 
260 265 270 

tct atg aaa gaa gta tta aaa gat ate caa aac ggt aca ttc gca aaa 
Ser Met Lys Glu Val Leu Lys Asp He -Gin Asn Gly Thr Phe Ala Lys 
275 280 285 

gag tgg ate gtc gaa aac caa gta aac cgt cct cgt ttc aac get ate 
Glu Trp He Val Glu Asn Gin Val Asn Arg Pro Arg Phe Asn Ala He 
290 295 300 



cgt gaa atg atg ccg ttt gtg aaa caa ggc aag aag aag gaa gcg gtg 
Arg Glu Met Met Pro Phe Val Lys Glri Gly Lys Lys Lys Glu Ala Val 
325 330 335 

gtc tec gtt gcg caa aat taa 
Val Ser Val Ala Gin Asn 
340 



<210> 36 
<211> 342 
<212> PRT 

<213> Bacillus subtilis 
<400> 36 

Met Val Lys Val Tyr Tyr Asn Gly Asp He Lys Glu Asn Val Leu Ala 
1^5 10 15 

Gly Lys Thr Val Ala Val He Gly Tyr Gly Ser Gin Gly His Ala His 
20 25 30 



672 



720 



7-68 



816 



864 



912 



aat gca age gag aac gaa cat caa ate gaa gta gtg gga aga aag ctt 960 
Asn Ala Ser Glu Asn Glu His -Gin He Glu Val Val Gly Arg Lys Leu 
305 310 315 320 



1008 



1029 
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Ala Leu Asn Leu Lys Glu Ser Gly Val Asp Val lie Val Gly Val Arg 
35 40 4 5 

Gin Gly Lys Ser Phe Thr Gin Ala Gin Glu Asp Gly His Lys Val Phe 
50 55 60 

'ser Val Lys Glu Ala Ala Ala Gin Ala Glu He He Met Val Leu Leu 
65 70 75 80 

Pro Asp Glu Gin Gin Gin Lys Val Tyr Glu Ala Glu He Lys Asp Glu 
85 90 95 

Leu Thr Ala Gly Lys Ser Leu Val Phe Ala His Gly Phe Asn Val His 
100 105 no 

Phe His Gin He Val Pro Pro Ala Asp Val Asp Val Phe Leu Val AJ a 
115 120 125 

Pro Lys Gly Pro Gly His Leu Val Arg Arg Thr Tyr Glu Gin Gly Ala 
130 135 140 

Gly Val Pro Ala Leu Phe Ala He Tyr Gin Asp Val Thr Gly Glu Ala 
,145 150 155 160 

Arg Asp Lys Ala Leu Ala Tyr Ala Lys Gly He Gly Gly Ala Arg Ala 
165 ^ 170 175 

Gly Val Leu Glu Thr Thr Phe Lys' Glu Glu Thr Glu Thr Asp Leu Phe 
180 185 190 

Gly Glu Gin Ala Val Leu Cys Gly Gly Leu Ser Ala Leu Val Lys Ala 
• 195 200 205 

Gly Phe Glu Thr Leu Thr Glu Ala Gly Tyr Gin Pro Glu Leu Ala Tyr 
210 215 220 

Phe Glu Cys Leu His Glu Leu Lys Leu He Val Asp Leu Met Tyr Glu 
225 230 235 240 

Glu Gly Leu Ala Gly Met Arg Tyr Ser He Ser Asp Thr Ala Gin Trp 
245 250 255 

Gly Asp Phe Val Ser <Gly Pro Arg Val Val Asp Ala Lys Val Lys Glu 
260 265 270 

Ser Met Lys Glu Val Leu Lys Asp He Gin Asn Gly Thr Phe Ala Lys 
275 280 285 

Glu Trp He Val Glu Asn Gin Val Asn Arg Pro Arg Phe Asn Ala He 
290 295 3.00 

Asn Ala Ser Glu Asn Glu His Gin He Glu Val Val Gly Arg Lys Leu 
305 310 315 320 

Arg Glu Met Met Pro Phe Val Lys Gin Gly Lys Lys Lys Glu Ala Val 
325 330 * " 335 



Val Ser Val Ala Gin Asn 
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340 



<210> 37 
<211> 1674 
<212> DNA 

<213> Bacillus subtilis 

<220> 

<221> CDS 

<222> (1)..(1674) 

<400> 37 

atg gca gaa tta cgc agt aat atg ate aca raa gga ate gat aga get 43 

Met Ala Glu Leu Arg Ser Asn Met lie Thr Gin Gly lie Asp Arg Ala 
1 5. 10 IS 

ccg cac cgc agt ttg ctt cgt gca gca ggg gta aaa gaa ,gag gat ttc 96 
Pro His Arg Ser Leu Leu Arg Ala Ala Gly Val Lys Glu "Glu Asp Phe 
20 25 M 30 

ggc' aag ccg ttt att gcg gtg tgt aat tea tac att gat ate gtt ccc 144 
Gly Lys Pro Phe lie Ala Val Cys Asn Ser Tyr lie Asp lie Val Pro 
35 40 45 

ggt cat gtt cac ttg cag gag ttt ggg aaa ate gta aaa gaa gca ate 192 
Gly His Val His Leu Gin <31u Phe Gly Lys He Val Lys Clu Ala He 
50 55 60 

aga gaa gca ggg ggc gtt ccg ttt gaa ttt aat ace att ggg gta gat 240 
Arg Glu Ala Gly Gly Val Pro Phe Glu Phe Asn Thr He Gly Val Asp 
65 70 75 B0 ' 

gat ggc arc gca atg ggg cat ate ggt atg aga tat teg ctg oca age 288 
Asp Gly He Ala Met Gly His He Gly Met Arg Tyr Ser Leu Pro Ser 
85 90 95 

cgt gaa att ate gca gac tct gtg gaa acg gtt gta tec gca cac tgg 336 
Arg Glu He lie Ala Asp ~Ser Val Glu* Thr Val Val Ser Ala His Trp 
100 105 110 

ttt gac gga atg gtc tgt att -ccg aac tgc gac aaa ate aca ccg -gga 384 
Phe Asp Gly Met Val Cys lie Pro Asn Cys Asp Lys lie Thr Pro Gly 
115 120 125 

atg ctt atg gcg gca atg cgc ate aac att ccg acg att ttt gtc age 4 32 
Met Leu Met Ala Ala Met Arg lie Asn lie Pro Thr lie Phe Val Ser 
130 135 140 

ggc gga ccg atg gcg gca gga aga aca agt tac ggg cga aaa ate tec 4 80 
Gly Gly Pro Met Ala Ala Gly Arg Thr Ser Tyr <51y Arg Lys lie Ser 
145 150 155 160 

ctt tec. tea gta ttc gaa ggg gta ggc gee tac caa gca ggg aaa ate 528 
Leu Ser Ser Val Phe Glu Gly Val Gly Ala Tyr -Gin Ala Gly Lys lie 
165 170 175 

aac gaa aac gag ctt caa gaa eta gag cag ttc gga tgc cca acg tgc 57 6 
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Asn Glu Asn Glu Leu Gin Glu Leu Glu <51n Phe Gly Cys Pro Thr Cys 
180 185 190 

ggg tct tgc tea ggc atg ttt acg gcg aac tea atg aac tgt ctg tea 624 

Gly Ser Cys Ser Gly Met Phe Thr Ala Asn Ser Met Asn Cys Leu Ser 
, 195 200 205 

gaa gca ctt ggt ctt get ttg ccg ggt aat gga ace att ctg gca aca 672 

Glu Ala Leu Gly Leu Ala Leu Pro Gly Asn Gly Thr He Leu ,Ala Thr 
210 215 220 

tct ccg gaa cgc aaa gag ttt gtg aga aaa teg get gcg caa tta atg 720 
Ser Pro Glu Arg Lys Glu Phe Val Arg Lys Ser Ala Ala Gin Leu Met 
225 230 235 240 

gaa acg att cgc aaa gat ate aaa ccg cgt gat att gtt aca gta aaa 768 
Glu Thr He Arg Lys Asp He Lys Pro Arg Asp He Val Thr Val Lys 
245 250 255 

gcg att gat aac gcg ttt gca etc gat atg gcg etc gga ggt tct aca 816 
Ala He Asp Asn Ala Phe Ala Leu Asp Met Ala Leu Gly Gly Ser Thr 
260 265 > 270 

aat ace gtt ctt cat ace ctt gee ctt gca aac gaa gee ggc gtt gaa 864 
Asn Thr Val Leu His Thr Leu Ala Leu Ala Asn Glu Ala Gly Val Glu 
275 280 285 

tac tct tta gaa cgc att aac gaa gtc get gag cgc gtg ccg cac ttg 912 
Tyr Ser Leu Glu Arg He Asn Glu Val Ala Glu Arg Val Pro His Leu 
290 295 300 

get aag ctg gcg cct gca teg gat gtg ttt att gaa gat ctt cac gaa 960 
Ala Lys Leu Ala Pro Ala Ser Asp Val Phe He Glu Asp Leu His Glu 
305 310 315 320 

gcg ggc ggc gtt tea gcg get ctg aat gag ctt teg aag aaa gaa gga 1008 
Ala Gly Gly Val Ser Ala Ala Leu Asn Glu Leu Ser Lys Lys <Slu Gly 
325 330 335 

gcg ctt cat tta gat gcg ctg act gtt aca gga aaa act ctt gga gaa 1056 
Ala Leu His Leu Asp Ala Leu Thr Val Thr Gly Lys Thr Leu Gly Glu 
340 345 350 

ace att gee gga cat gaa gta aag gat tat gac gtc att cac .ccg ctg 1104 
Thr He Ala Gly His Glu Val Lys Asp Tyr Asp Val He His Pro Leu 
355 360 365 

gat caa cca ttc act gaa aag gga ggc ctt get gtt tta ttc <jgt aat 1152 
Asp Gin Pro Phe Thr Glu Lys Gly Gly Leu Ala Val Leu Phe Gly Asn 
370 375 380 

eta get ccg gac ggc get ate att aaa aca ggc ggc gta cag aat ggg 1200 
Leu Ala Pro Asp Gly Ala lie He Lys Thr Gly Gly Val Gin Asn Gly 
385 390 395 400 

att aca aga cac gaa ggg ccg get gtc gta ttc gat tct cag gac gag 1248 
He Thr Arg His Glu Gly Pro Ala Val Val Phe Asp Ser Gin Asp Glu 
405 410 415 
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gcg ctt gac ggc att ate aac cga aaa gta aaa gaa ggc gac gtt gtc 1296 
Ala Leu Asp Gly He He Asn Arg Lys Val Lys *Glu Gly Asp Val Val 
420 42S 430 

» 

ate ate aga tac gaa ggg eca aaa ggc gga cct ggc atg ecg gaa atg 134 4 
He He Arg Tyr Glu Gly Pro Lys -Gly Gly Pro Gly Met Pro Glu Met 
435 440 445 

ctg gcg cca aca tec caa ate gtt gga atg gga etc ggg cca aaa gtg 1392 
Leu Ala Pro Thr Ser Gin lie Val Gly Met Gly Leu Gly Pro Lys Val 
450 455 460 

gca ttg att acg gac gga cgt ttt tec gga gec *cc egt ggc etc tea 14 40 
Ala Leu He Thr Asp Gly Arg Phe Ser Gly Ala Ser Arg Gly Leu Ser 
465 470 475 480 

ate ggc cac gta tea cct gag gec get gag -ggc ggg ceg»ett gee ttt 1488 
He Gly His Val Ser Pro Glu Ala Ala Glu Gly Gly Pro Leu Ala Phe 
485 4 90 4 95 

gtt gaa aac gga gac cat att ate gtt gat att gaa aaa egc ate ttg 1536 
Val Glu Asn Gly Asp His He He Val Asp He Glu Lys Arg He Leu 
500 505 510 

gat gta caa gtg cca gaa gaa gag tgg gaa aaa cga aaa gcg aac tgg 1584 
Asp Val Gin- Val Pro Glu Giu Glu Trp Glu Lys Arg Lys Ala Asn Trp 
515 520 525 

aaa ggt ttt gaa ccg aaa gtg aaa ace gge tac ctg gca egt tat tet 1632 
Lys Gly Phe Glu Pro Lys Val Lys Thr Gly Tyr Leu Ala Arg Tyr Ser 
530 535 540 

aaa ctt gtg aca agt gec aac acc ggc ggt att atg aaa ate 1674 
Lys Leu Val Thr Ser Ala Asn Thr Gly Gly He Met Lys He 
545 550 555 



<210> 38 
<211> 558 
<212> PRT 

<213> Bacillus subtilis 
<400> 38 

Met Ala Glu Leu Arg Ser Asn Met He Thr Gin Gly He Asp Arg Ala 
15 10 15 

Pro His Arg Ser Leu Leu Arg Ala Ala Gly Val Lys Glu Glu Asp Phe 
20 25 30 

Gly Lys Pro Phe He Ala Val Cys Asn Ser Tyr lie Asp He Val Pro 
35 40 45 

Gly His Val His Leu Gin Glu Phe Gly Lys He Val Lys Glu Ala He 
50 55 60 

Arg Glu Ala Gly Gly Val Pro Phe Glu Phe Asn Thr He Gly Val Asp 
65 70 75 80 



1 



WO 01/21772 PCT/US00/25993 

-47 - 



Asp Gly He Ala Met Gly His He Gly Met Arg Tyr Ser Leu Pro Ser 
85 90 1 95 

Arg Glu He He Ala Asp Ser Val Glu Thr Val Val Ser Ala His Trp 
100 105 110 

Phe Asp Gly Met Val Cys He Pro Asn Cys Asp Lys He Thr Pro Gly 
115 120 125 

Met Leu Met Ala Ala Met Arg He Asn He Pro Thr He Phe Val Ser 
130 135 140 

Gly Gly Pro Met Ala Ala Gly Arg Thr Ser Tyr Gly Arg Lys He Ser 
145 150 155 " ' 160 

Leu Ser Ser Val Phe Glu Gly Val Gly Ala Tyr Gin Ala Gly Lys He 
165 170 ^ 175 

Asn Glu Asn Glu Leu Gin Glu Leu Glu Gin Phe Gly Cys Pro Thr Cys 
180 185 190 

i 

, Gly Ser Cys Ser Gly Met Phe Thr Ala Asn Ser Met Asn Cys Leu Ser 
195 200 205 

Glu Ala Leu Gly Leu Ala Leu Pro Gly Asn Gly Thr He Leu Ala Thr 
210 215 220 

Ser Pro Glu Arg Lys Glu Phe Val Arg Lys Ser Ala Ala Gin Leu Met 
225 230 235 240 

Glu Thr He Arg Lys Asp He Lys Pro Arg Asp lie Val Thr Val Lys 
245 250 255 

Ala He Asp Asn Ala Phe Ala Leu Asp Met Ala Leu Gly Gly Ser Thr 
260 265 270 

Asn Thr Val Leu His Thr Leu Ala Leu Ala Asn Glu Ala Gly Val Glu 
275 280 285 

Tyr Ser Leu Glu Arg He Asn Glu Val Ala Glu Arg Val Pro His Leu 
290 295 300 

Ala Lys Leu Ala Pro Ala Ser Asp Val Phe He Glu Asp Leu His Glu 
305 310 315 320 

Ala Gly Gly Val Ser Ala Ala Leu Asn Glu Leu Ser Lys Lys Glu Gly 
325 330 ' 335 

Ala Leu His Leu Asp Ala Leu Thr Val Thr Gly Lys Thr Leu Gly Glu 
340 345 350 

Thr lie Ala Gly His Glu Val Lys Asp Tyr Asp Val He His Pro Leu 
355 360 365 

Asp Gin Pro Phe Thr Glu Lys Gly Gly Leu Ala Val Leu Phe Gly Asn 
370 375 380 
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Leu Ala Pro Asp Gly Ala lie II* Lys Thr Gly Gly Val Gin Asn Gly 
385 390 • 395 400 

He Thr Arg His Glu Gly Pro Ala Val Val Phe A^p Ser Gin Asp Glu 
405 410 415 

Ala Leu Asp Gly He He Asn Arg Lys Val Lys *Glu Gly Asp Val Val 
4 20 425 430 

He He Arg Tyr Glu Gly Pro Lys Gly Gly Pro Gly Met Pro Glu Met 
435 440 445 

Leu Ala Pro Thr Ser Gin He Val Gly Met Gly Leu Gly Pro Lys Val 
450 455 460 

Ala Leu He Thr Asp Gly Arg Phe Ser Gly Ala Ser Arg Gly Leu Ser 
4 65 470 475 480 

♦ 

He Gly His Val Ser Pro Glu Ala Ala Glu Gly Gly Pro Leu Ala Phe 
485 490 495 

Val Glu Asn Gly Asp His He He Val Asp lie Glu Lys Arg He Leu 
500 505 510 

Asp Val Gin Val Pro Glu Glu Glu Trp Glu Lys Arg Lys Ala Asn Trp 
515 520 525 

Lys Gly Phe Glu Pro Lys Val Lys Thr -Gly Tyr Leu Ala Arg Tyr Ser 
530 535 '540 

Lys Leu Val Thr Ser Ala Asn Thr Gly Gly He Wet Lys He 
545 550 555 



<210> 39 
<211> 194 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : promoter 
sequence 

<220> 

<221> -35_signal 
<222> (136) . . (141) 

<220> 

<221> -10_signal 
<222> (159) . . (164) 

<400> 39 

gctattgacg acagctatgg ttcactgtcc accaaccaaa actgtgctca gtaccgccaa 60 

tatttctccc ttgaggggta caaagaggtg t<rcctagaag agat<:cacgc tgtgtaaaaa 120 

ttttacaaaa aggtattgac tttccctaca gggtgtgtaa taat«ttaatt acaggcgggg 180 
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gcaaccccgc ctgt 194 



<210> 40 
<211> 163 
<212> DNA 

'<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : promoter 
sequence 

<220> 

<221> -35_signal 
<222> (113) . . (118) 

<220> 

<221> -10_signal 
<222> (136) . . (141) 

<400> 40 

gcctacctag cttccaagaa agatatccta acagcacaag agcggaaaga tgttttgttc 60 
tacatccaga acaacctctg ctaaaattcc tgaaaaattt tgcaaaaagt tgttgacttt 120 
atctacaagg tgtggtataa taatcttaac aacagcagga cgc 163 



<2I0> 41 
<211> 127 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : promoter 
sequence 

<220> 

<221> -35_signal 
<222> (34) . . (39) 

<220> 

<221> -10_signal 
<222> (58) . . (63) 

<220> 

<221> -35_signal 
<222> (75) . . (80) 

<220> 

<221> -10_signal 
<222> (98) . . (103) 

<400> 41 

gaggaatcat agaattttgt caaaataatt ttattgacaa cgtcttatta acgttgatat 60 
aatttaaatt ttatttgaca aaaatgggct cgtgttgtac aataaatgta gtgaggtgga 120 
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tgcaatg 127 

<210> 42 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : ribosome 
binding site 

<400> 42 

taaacatgag gaggagaaaa catg 24 

<210> 43 
<211> 28 

<212> DNA ♦ 
<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence : ribosome 
binding site 

<400> 43 

attcgagaaa tggagagaat ataatatg 28 

<210> 44 
<211> 13 
<212> DNA 

<213> Artificial Sequence 
<22C> 

<223> Description of Artificial Sequence : ribosome 
binding site 

<400> 44 

agaaaggagg tga 13 

<210> 45 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: ribosome 
binding site 

<400> 45 

ttaagaaagg aggtgannnn atg 23 

<210> 46 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: ribosome 
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binding site 



<400> 46 

ttagaaagga ggtgannnnn atg 



23 



<210> 47 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : ribo some 
binding site 

<400> 47 

agaaaggagg tgannnnnnn atg 23 

<210> 48 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : ribosome 
binding site 



<210> 49 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : ribosome 
binding site 

<400> 49 

ccctctagaa ggaggagaaa acatg 25 

<210> 50 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : ribosome 
binding site 



<400> 48 

agaaaggagg tgannnnnna tg 



22 



<400> 50 

ccctctagag gaggagaaaa catg 



24 



<210> 51 
<211> 23 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : ribo some 
( binding site 

<400> 51 

ttagaaagga ggatttaaat atg ■ 23 

<210> 52 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : ribosome 
binding site 

<400> 52 

ttagaaagga ggtttaatta atg 23 

i 

, <210> 53 
<211> 23 
<212> DNA 

<213> Artificial Sequence 

<220> ' ' ' , ' 

<223> • Description of Artificial Sequence : ribo some 
binding site 

<400> 53 

ttagaaagga ggtgatttaa atg -23 ' 

<210> 54 

<211> 23 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: ribosome 
binding site 

<400> 54 

ttagaaagga ggtgtttaaa atg -23 

<210> 55 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial "Sequence: ribosome 
binding site 

<400> 55 

attcgagaaa ggaggtgaat ataatatg 28 
<210> 56 
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<211> 27 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description' of Artificial Sequence : ribosome 
binding site 

<400> 56 

attcgagaaa ggaggtgaat aataatg 27 

<210> 57 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : ribosome 
binding site 

♦ 

<400> 57 

attcgtagaa aggaggtgaa ttaatatg 28 



<210> 58 
<211> 3291 
<212> DNA 

<213> Bacillus subtilis 
<400> 58 

atggggacta atgtacaggt ggattcagca 

gcattaatgc tgattgaatc attaaaaaaa 

ggcggggctg tgcttccgat ttacgataag 

ccccgtcacg aacaaggagc aattcatgca 

ccgggtgtcg tcattgccac gtcagggccg 

gatgccatga ttgattcatt gccgttagtc 

atcgggagcg atgcatttca ggaagcagac 

cacagctacc aggttcgcca gccggaagat 

attgcaacaa ctggaagacc cggacctgta 

attgaaggag aattcagcta cgatcatgag 

gagccgaatt atttgcagat ccgcaagctt 

gtgatcctgg cgggtgcggg cgtactgcac 

gctgaacagc agcaaatccc tgtggcacac 

gaccatccgc ttttcctagg gatggcggga 



tctgccgaat 


gtacacagac 


gatgagcgga 


60 


gagaaagtag 


aaatgatctt 


cggttatccg 


120 


ctatacaatt 


cagggttggt 


acatatcctt 


180 


gcggagggat 


acgcaagggt 


ctccggaaaa 


240 


ggagcgacaa 


accttgttac 


aggccttgct 


300 


gtctttacag 


ggcaggtagc 


aacctctgta 


360 


attttaggga 


ttacgatgcc 


agtaacaaaa 


420 


ctgccgcgca 


tcattaaaga 


agcgttccat 


480 


ttgattgata 


ttccgaaaga 


tgtagcaaca 


540 


atgaatctcc 


cgggatacca 


gccgacaaca 


600 


gtggaagccg 


tgagcagtgc 


gaaaaaaccg 


660 


ggaaaagcgt 


cagaagaatt 


aaaaaattat 


720 


acccttttgg 


ggctcggagg 


cttcccggct 


780 


atgcacggta 


cttatacagc 


caatatggcc 


840 
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cttcatgaat gtgatctatt aatcagtatc ggcgcccgtt ttgatgaccg tgfccacagga 900 
aacctgaaac actttgccag aaacgcaaag atagcccaca bcgatat<tga tccagctgaa 960 
atcggaaaaa tcatgaaaac acagattcct gtagtcggag acagcaaaat ^tcctgcag 1020 
gagctgatca aacaagacgg caaacaaagc gattcaagcg aatggaaaaa acagctcgca 1080 
gaatggaaag aagagtatcc gctctggtat gtagataatg aagaagaagg ^tttaaacct 1140 
cagaaattga ttgaatatat t-catcaattt acaaaaggag aggccattgt cgcaacggat 1200 
gtaggccagc atcaaatgtg gtcagcgcaa ttttatccgt tccaaaaagc agataaatgg 1260 
gtcacgtcag gcggacttgg aacgatggga ttcggtcttc trggcggcgat *cggcgcacag 1320 
ctggccgaaa aagatgctac tgttgtcgcg gttgtcggag acggcggatt ccaaatgacg 1380 
cttcaagaac tcgatgttat tcgcgaatta aatcttccgg tcaaggtagt gattttaaat 1440 
aacgcttgtc tcggaatggt cagacagtgg caggaaattt tctatgaaga acgttatt<:a 1500 
gaatctaaat tcgcttctca gcctgacttc gtcaaattgt ccgaagcata oggcattaaa 15*60 
ggcatcagaa tttcatcaga agcggaagca aaggaaaagc *tggaagaggc attaacatca 1-620 
agagaacctg ttgtcattga cgtgcgggtt gccagcgaag aaaaagtatt cccgatggtg 1680 
gctccgggga aagggctgca tgaaatggtg ggggtgaaac cttgaaaaga at t a tea cat 17 40 
tgactgtggt gaaccgctcc ggggtgttaa accggatcac eggtctatte acaaaaaggc 1600 
attacaacat tgaaagcatt acagttggac acacagaaac ageeggegtt tccagaat<:a 1860 
ccttcgtcgt tcatgttgaa ggtgaaaatg atgttgaaca gttaacgaaa -cagctcaaca 1920 
aacagattga tgtgctgaaa gtcacagaca tcacaaatca atcgattgtc cagagggagc 1980 
tggccttaat caaggttgtc tccgcacctt caacaagaac agagattaat ggaatcatag 2O40 
aacegtttag agcctctgtc gttgatgtca gcagagacag catcgttgtt: x:aggtgacag 2100 
gtgaatctaa caaaattgaa -gcgcttattg agttattaaa accttatggc attaaagaaa 2160 
tegegagaac aggtacaacg gettttgega ggggaaccag ^caaaaggegt catccaataa 2220 
aacaatatct attgtataaa acataacaag ggagagattg aaatggtaaa agtatattat 2280 
aacggtgata tcaaagagaa cgtattggct ggaaaaacag tageggttat -egggtaeggt 2340 
tcgcaaggcc acgcacatgc cctgaacctt aaagaaagcg gagtagacgt gategtoggt 2400 
gttagacaag gaaaatcttt cactcaagcc x:aagaagacg gacataaagt attttcagta 2460 
aaagaagegg cagcccaagc cgaaatcatc atggttctgc ttccggatga gcagcagcaa 2520 
aaagtatacg aagctgaaat caaagatgaa ttgacagcag gaaaat<;att agtatfccgct 2580 
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catggattta 


acgtgcattt 


ccatcaaatt 


gttcctccgg cggatgtaga 


tgtattctta 


2640 


gtggccccta 


aaggcccggg 


acacttggta 


agaagaacat atgagcaagg agctggcgta 


2700 


cctgcattgt 


tcgcaatcta tcaagatgtg 


actggagaag caagagacaa 


agccctcgct 


27,60 


tatgctaaag 


gaatcggcgg 


cgcaagagcg 


ggcgtattag aaacgacatt 


taaagaagaa 


2820 


acagaaacag 


atttgttcgg 


tgagcaagca 


gttctttgcg gcggattaag 


cgcgcttgtc 


2880 


aaagccggat 


ttgaaacctt 


aactgaagca 


ggttatcagc ctgaacttgc 


atacttcgag 


2940 


tgtcttcatg 


agctgaaatt 


aatcgtagac 


cttatgtacg aagaaggact 


tgcaggaatg 


3000 


agatattcaa 


tctctgacac 


agcacagtgg 


ggagatttcg tatcaggccc tcgcgttgtg 


3060 


gacgccaaag 


taaaagaatc 


tatgaaagaa 


gtattaaaag atatccaaaa 


cggtacattc 


3120 


gcaaaagagt 


ggatcgtcga 


aaaccaagta 


aaccgtcctc gtttcaaegc 


tatcaatgca 


3180 


agcgagaacg 


aacatcaaat 


cgaagtagtg 


ggaagaaagc ttcgtgaaat 


gatgccgttt 


3240 


gtgaaacaag 


gcaagaagaa 


ggaagcggtg 


gtctccgttg cgcaaaatta 


a 


3291 



<210> 59 
<211> 2363 
<212> DNA 

<213> Bacillus subtilis 

<220> 
<221> CDS 

<222> (242) (1072) 

<220> • 
<221> CDS 

<222> (1077) (1934) 

<220> 
<221> CDS 

<222> (1939) . . (2319) 
<400> 59 

ttggtacaag cccgttgatt ttggtatact tccattgggc agtatcgcct gcgaactgca 60 

cctattatta aaatagatag acattgcagc agtctgcctt gatccaaaaa aggactggga 120 

cagagggatg aaactcgccg aactttagaa agtgaagaat ccttctcgtt gtaacggaag 180 

gttttttggc ttgcagaaga aaacggcaga tcatctcctc taaacatgag gaggagaaaa 240 

c atg aaa aca aaa ctg gat ttt eta aaa atg aag gag tct gaa gaa ccg 289 
Met Lys Thr Lys Leu Asp Phe Leu Lys Met Lys Glu Ser Glu Glu Pro 
15 10 15 

att gtc atg ctg acc get tat gat tat ccg gca get aaa ctt get gaa 337 
He Val Met Leu Thr Ala Tyr Asp Tyr Pro Ala Ala Lys Leu Ala Glu 
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20 25 30 

caa gcg gga gtt gac atg att tta gt<: ggt gat tea ctt gga atg gtc 385 
Gin Ala Gly Val Asp Met He Leu Val Gly Asp Ser Leu Gly -Met Val 
35 40 45 

gtc etc ggc ctt gat tea act gtc ggt gtg aca gtt gcg gac atg ate 433 
Val Leu Gly Leu Asp Ser Thr Val <31y Val Thr Val Ala Asp Met He 
50 55 '60 

cat cat aca aaa gec gtt aaa agg ggt gcg ccg aat ace ttt att gtg 4-81 
His His Thr Lys Ala Val Lys Arg Cly Ala -Pare Asn Thr Phe He Val 
65 ' 70 7S 80 

i 

aca gat atg ccg ttt atg tct tat cac ctg tct aag gaa ^at acq ctg 529 
Thr Asp Met Pro Phe Met Ser Tyr His Leu Ser Lys <31u Asp Thr Leu 
85 90 95 

aaa aat gca gcg get ate gtt cag gaa age gga get -gac gea ctg aag 577 

Lys Asn Ala Ala Ala lie Val -Gin Glu Ser Gly Ala Asp Ala Leu Lys 

100 105 ' H-0 

i 

ctt gag ggc gga gaa ggc gtg ttt gaa tec att cgc gca ttg acg ctt 625 

Leu Glu Gly Gly Glu Gly Val Phe -Glu Ser He Arg Ala Leu Thr Leu 
115 120 125 

gga ggc att cca gta gtc agt cac 'tta ggt ttg aca ccg -cag tea gtc 673 
Gly Gly He Pro Val Val "Ser His Leu -Gly Leu Thr Pro <Un Ser Val 
130 135 140 

ggc gta ctg ggc ggc tat aaa gta ca<g ggc aaa -gac gaa caa age gec 721 
Gly Val Leu Gly Gly Tyr Lys Val ^Gln XSly Lys Asp *Glu Gin Ser Ala 
145 150 155 160 

aaa aaa tta ata gaa gac agt ata aaa tgc gaa gaa -gca gga get atg 
Lys Lys Leu He Glu Asp Ser He Lys -Cys Clu *51u Ala Gly Ala Met 
165 170 175 

atg ctt gtg ctg gaa tgt gtg ocg gca gaa etc aca gec aaa att gec 
Met Leu Val Leu Glu Cys Val Pro Ala -Glu Leu Thr Ala Lys He Ala 
180 185 190 

gag acg eta age ata ccg gtc att gga ate ggg got ggt gtg aaa gc<g 665 
Glu Thr Leu Ser He Pro Val He "Gly lie -Gly Ala Gly Val Lys Ala 
195 200 205 

gac gga caa gtt etc gtt tat cat gat att ate ggc cac -ggt gtt gag 913 
Asp Gly Gin Val Leu Val Tyr His Asp He He Gly His -Gly Val -Glu 
210 215 220 



aga aca cct aaa ttt gta aag caa tat acg cgc att gat gaa ace ate 
Arg Thr Pro Lys Phe Val Lys Gin Tyr Thr Arg He Asp Glu Thr lie 
225 230 235 240 

gaa aca gca ate age gga tat gtt cag gat gta aga cat -cgt -get ttc 
Glu Thr Ala He Ser <31y Tyr Val Gin Asp Val Arg His Arg Ala tfhe 
24 5 250 255 



769 



817 



^961 



1009 
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cct gaa caa aag cat tec ttt caa atg aac cag aca gtg ctt gac ggc , 1057 
Pro Glu Gin Lys His Ser Phe Gin Met Asn Gin Thr Val Leu Asp Gly 
260 ( 265 270 

ttg tac ggg gga aaa taag atg aga cag att act gat att tea cag ctg 1106 
Leu Tyr Gly Gly Lys Met Arg Gin He Thr Asp He Ser Gin Leu 

275 280 285 

aaa gaa gec ata aaa caa tac cat tea gag ggc aag tea ate gga ttt 1154 
Lys Glu Ala He Lys Gin Tyr His Ser Glu Gly Lys Ser He Gly Phe 
290 295 300 

gtt ccg acg atg ggg ttt ctg cat gag ggg cat tta acc tta gca gac 1202 
Val Pro Thr Met Gly Phe Leu His Glu Gly His Leu Thr Leu Ala Asp 
305 310 315 

aaa gca aga caa gaa aac gac gee gtt att atg agt att ttt gtg aat 1250 

Lys Ala Arg Gin Glu Asn Asp Ala Val He Met Ser He Phe Val Asn 
320 325 330 335 

i 

cct gca caa ttc ggc cct aat gaa gat ttt gaa gca tat ccg cgc gat 1298 

Pro Ala Gin Phe Gly Pro Asn Glu Asp Phe Glu Ala Tyr Pro Arg Asp 
340 345 350 

att gag egg gat gca get ctt gca gaa aac gec gga gtc gat att ctt 1346 
He Glu Arg Asp Ala Ala Leu Ala Glu Asn Ala Gly Val Asp He Leu 
355 360 365 

ttt acg cca gat get cat gat atg tat ccc ggt gaa aag aat gtc acg 1394 
Phe Thr Pro Asp Ala His Asp Met Tyr Pro Gly Glu Lys Asn Val Thr 
370 375 ~ 380 

att cat gta gaa aga cgc aca gac gtg tta tgc ggg cgc tea aga gaa 1442 
He His Val Glu Arg Arg Thr Asp Val Leu Cys Gly Arg Ser Arg Glu 
385 390 395 

gga cat ttt gac ggg gtc gcg ate gta ctg acg aag ctt ttc aat eta 14 90 
Gly His Phe Asp Gly Val Ala He Val Leu Thr Lys Leu Phe Asn Leu 
400 405 410 415 

gtc aag ccg act cgt gee tat ttc ggt tta aaa gat gcg cag cag gta 1538 
Val Lys Pro Thr Arg Ala Tyr Phe Gly Leu Lys Asp Ala Gin Gin Val 
420 425 430 

get gtt gtt gat ggg tta ate age gac ttc ttc atg gat att gaa ttg 1586 
Ala Val Val Asp Gly Leu He Ser Asp Phe Phe Met Asp He Glu Leu 
435 440 445 

gtt cct gtc gat acg gte aga gag gaa gae ggc tta gee aaa age tct 1634 
Val Pro Val Asp Thr Val Arg Glu Glu Asp Gly Leu Ala Lys Ser Ser 
450 455. 460 

cgc aat gta tac tta aca get gag gaa aga aaa gaa gcg cct aag ctg 1682 
Arg Asn Val Tyr Leu Thr Ala Glu Glu Arg Lys Glu Ala Pro Lys Leu 
465 470 475 



tat egg gec ctt caa aca agt gcg gaa ctt gtc caa gee ggt gaa aga 
Tyr Arg Ala Leu Gin Thr Ser Ala Glu Leu Val Gin Ala Gly Glu Arg 



1730 
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480 485 490 49S 

gat cct gaa gcg gtg ata aaa get gca aaa gat ate att gaa acg act 1778 

Asp Pro Glu Ala Val He Lys Ala Ala Lys Asp lie He Glu Thr Thr 
500 505 510 

age gga ace ata gac tat gta gag ctt tat tee tat -ccg <gaa etc gag 1826 

Ser Gly Thr He Asp Tyr Val Glu Leu Tyr Ser Tyr Pro <31u,Leu -Glu 
515 520 525 



cct gtg aat gaa att get gga aag atg att etc get gtt gca gtt -get 
Pro Val Asn Glu He Ala Cly Lys Met lie Leu Ala Val Ala Val Ala 
530 535 540 

ttt tea aaa gcg cgt tta ata gat aat ate att att gat att ega gaa 
Phe Ser Lys Ala Arg Leu He Asp Asn He He He Asp lie Arg <31u 
545 550 555 

atg gag aga ata taat atg tat cga aca atg atg age ggc aaa ctt <cac 
Met Glu Arg He Met Tyr Arg Thr Met Wet Ser Gly Lys Leu His 

560 565 570 • 

i 

agg gca act gtt acg gaa gca aac ctg aac tat gtg gga age att aca 
Arg Ala Thr Val Thr -Glu Ala Asn Leu Asn Tyr Val Gly Ser He Thr 
575 580 "585 590 

ate gat gaa gat etc att gat get' gtg gga atg ctt ect aat gaa aaa 
He Asp Glu Asp Leu He Asp Ala Val Gly Met Leu Pro Asn Glu Lys 
595 600 €05 



gca cgc ctt gtg cag gaa gga gat aag gte att att att tec tac aaa 
Ala Arg Leu Val Gin Glu Gly Asp Lys Val He He He Ser Tyr Lys 
640 645 650 

atg atg tct gat caa gaa gcg gca age cat gag ccg aaa gtg get gtt 
Met Met Ser Asp Gin Glu Ala Ala Ser His t51u Pro Lys Val Ala Val 
655 660 665 '670 

ctg aat gat caa aac aaa att gaa -caa atg ctg ggg aac gaa cca gee 
Leu Asn Asp Gin Asn Lys He Glu Gin Met Leu Gly Asn Glu Pro Ala 
€75 680 685 



1874 



1922 



1971 



2019 



2067 



gta caa att gtg aat aat aat aat gga gca cgt ctt gaa acg tat att 2115 
Val Gin He Val Asn Asn Asn Asn -Gly Ala Arg Leu Glu Thr Tyr He 
610 615 620 

att cct ggt aaa egg gga age ggc gtc ata tgc tta aac ggt gca -gee 2163 
He Pro Gly Lys Arg Gly Ser Gly Val He «Cys Leu Asn -Gly Ala Ala 
625 630 635 



2211 



2259 



2307 



cgt aca att ttg tagaagaaaa gcccccttta tcgggggttt tcttttaaga tttt 2363 
Arg Thr He Leu 
690 



<210> 60 
<211> 293 
<212> PRT 
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<213> Bacillus subtilis 
<400> 60 

Met Ser He Ala Val Ser Glu Glu Glu Ala Lys Ala Val Glu Gly Leu 
1 5 10 15 

Asn Asp Tyr Leu Ser Val Glu Glu Val Glu Thr He Tyr He Pro Leu 
20 25 30 

Val Arg Leu Leu His Leu His Val Lys Ser Ala Ala Glu Arg Asn Lys 
35 40 45 

His Val Asn Val Phe Leu Lys His Pro His Ser Ala Lys He Pro Phe 
50 55 60 

He He Gly lie Ala Gly Ser Val Ala Val Gly Lys Ser Thr Thr Ala 
65 70 75 80 

Arg lie Leu Gin Lys Leu Leu Ser Arg Leu Pro Asp Arg Pro Lys Val 
85 90 95 

Ser Leu He Thr Thr Asp Gly Phe Leu Phe Pro Thr Ala Glu Leu Lys 
100 105 110 

Lys Lys Asn Met Met Ser Arg Lys Gly Phe Pro Glu Ser Tyr Asp Val 
115 120 125 

Lys Ala Leu Leu Glu Phe Leu Asn Asp Leu Lys Ser Gly Lys Asp Ser 
130 135 140 

Val Lys Ala Pro Val Tyr Ser His Leu Thr Tyr Asp Arg Glu Glu Gly 
145 150 155 160 

Val Phe Glu Val Val Glu Gin Ala Asp He Val He He Glu Gly He 
165 170 175 

Asn Val Leu Gin Ser Pro Thr Leu Glu Asp Asp Arg Glu Asn Pro Arg 
180 185 190 

He Phe Val Ser Asp Phe Phe Asp Phe Ser He Tyr Val Asp Ala Glu 
195 200 205 

Glu Ser Arg He Phe Thr Trp Tyr Leu Glu Arg Phe Arg Leu Leu Arg 
210 215 220 

Glu Thr Ala Phe Gin Asn Pro Asp Ser Tyr Phe His Lys Phe Lys Asp 
225 230 235 240 

Leu Ser Asp Gin Glu Ala Asp Glu Met Ala Ala Ser He Trp Glu Ser 
245 250 255 

Val Asn Arg Pro Asn Leu Tyr Glu Asn He Leu Pro Thr Lys Phe Arg 
260 265 270 

Ser Asp Leu He Leu Arg Lys Gly Asp Gly His Lys Val Glu Glu Val 
275 280 285 



Leu Val Arg Arg Val 
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<210> 61 

<211> 281 - 

<212> PRT 

<213> Bacillus subtilis 

<400> 61 

Met <51u Gly Leu Asn Asp Tyr Leu Ser Val Glu <Uu Val <Glu Thr lie 
1 5 10 15 

Tyr He Pro Leu Val Arg Leu Leu His Leu His Val Lys Ser Ala Ala 
20 25 30 

Glu Arg Asn Lys His Val Asn Val Phe Leu Lys His Pro His Ser Ala 
35 40 45 

t 

Lys He Pro Phe lie lie Gly He Ala Gly Ser Val Ala Val <51y Lys 
50 55 60 

Ser Thr Thr Ala Arg He Leu Gin Lys Leu rLeu Ser Arg Leu Pro Asp 
65 70 75 80 

Arg Pro Lys Val Ser Leu He Thr Thr Asp -Gly Phe *Leu Phe .Pro Thr 
85 90 95 

Ala Glu Leu Lys Lys Lys Asn Met Met Ser Arg Lys -Gly Phe Pro Glu 
100 105 110 

Ser Tyr Asp Val Lys Ala Leu Leu Glu Phe Leu Asn Asp i>eu Lys Ser 
115 120 125 

Gly Lys Asp Ser Val Lys Ala Pro Val Tyr Ser His Leu Thr Tyr Asp 
130 135 140 

Arg Glu Glu Gly Val Phe Glu Val Val Glu -Gin Ala Asp He Val He 
14 5 150 155 160 

He Glu Gly He Asn Val Leu Gin Ser Pro Thr Leu Glu Asp Asp Arg 
165 170 175 

Glu Asn Pro Arg He Phe Val Ser Asp Phe Phe Asp Phe Ser He Tyr 
180 1B5 190 

Val Asp Ala Glu Glu Ser Arg He Phe Thr Trp Tyr Leu "Glu Arg Phe 
195 200 205 

• Arg Leu Leu Arg Glu Thr Ala Phe -Gin Asn Pro Asp Ser Tyr Phe His 
210 215 220 

Lys Phe Lys Asp Leu Ser Asp *Gln <Slu Ala Asp Glu Met Ala Ala Ser 
225 230 235 240 

He Trp Glu Ser Val Asn Arg Pro Asn Leu Tyr Glu Asn lie Leu Pro 
245 250 255 

Thr Lys Phe Arg Ser Asp Leu He Leu Arg Lys Gly Asp <Uy His Lys 
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260 265 270 

Val Glu Glu Val Leu Val Arg Arg Val 
275 280 

'<210> 62 
<211> 1092 
<212> DNA 

<213> Bacillus subtilis 

<220> 

<221> CDS 

<222> (1) . . (1089) 

<400> 62 

atg act aaa caa aca att cgc gtt gaa ttg aca tea aca aaa aaa ccg 48 
Met Thr Lys Gin Thr lie Arg Val Glu Leu Thr Ser Thr Lys Lys Pro 
15 10 15 

aaa cca gac cca aat cag ctt teg ttc gga aga gtg ttt aca gac cac 96 
Lys Pro Asp Pro Asn Gin Leu Ser Phe Gly Arg Val Phe Thr Asp His 
20 25 30 

atg ttt gta atg gac tat gec gca gat aaa ggt tgg tac gat cca aga 144 
Met Phe Val Met Asp Tyr Ala Ala Asp Lys Gly Trp Tyr Asp Pro Arg 
35 40 " 45 

ate att cct tat caa ccc tta tea atg gat cca act gca atg gtc tat 192 
lie He Pro Tyr Gin Pro Leu Ser Met Asp Pro Thr Ala Met Val Tyr 
50 55 60 

cac tac ggc caa ace gtg ttt gaa ggg tta aag get tac gtg tea gag 240 
His Tyr Gly Gin Thr Val Phe Glu Gly Leu Lys Ala Tyr Val Ser Glu 
65 70 75 80 

gat gac cat gtt ctg ctt ttc aga ccg gaa aaa aat atg gaa cgc ctg 288 
Asp Asp His Val Leu Leu Phe Arg Pro Glu Lys Asn Met Glu Arg Leu 
85 90 95 

aat caa tea aac gac cgc etc tgc ate ccg caa att gat gaa gaa cag 336 
Asn Gin Ser Asn Asp Arg Leu £ys He Pro Gin He Asp Glu Glu Gin 
100 105 110 

gtt ctt gaa ggc tta aag cag ctt gtc gca att gat aaa gac tgg att 384 
Val Leu Glu Gly Leu Lys Gin Leu Val Ala He Asp Lys Asp Trp He 
115 120 * 125 

cca aat gcg gag ggc acg tec ctt tac ate cgt ccg ttc ate ate gca 432 
Pro Asn Ala Glu Gly Thr Ser Leu Tyr He Arg Pro Phe He He Ala 
130 135 140 

acc gag cct ttc ctt ggt gtt gcg gca tct cat acg tat aag etc ttg 480 
Thr Glu Pro Phe Leu Gly Val Ala Ala Ser His Thr Tyr Lys Leu Leu 
145 150 155 160 

ate att ctt tct ccg gtc ggc tct tat tac aaa gaa ggc att aag ccg 528 
He lie Leu Ser Pro Val Gly Ser Tyr Tyr Lys Glu Gly He Lys Pro 
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•62 



1€5 



170 



175 



gtc aaa ate get gtt gaa agt gaa ttt gtc cgt geg gta aaa ggc gga 
Val Lys He Ala Val Glu Ser Glu Phe Val Arg Ala Val Lys Gly Cly 
180 185 190 



576 



aca gga aat gee aaa ace gea gga aac tat get tea age tta aaa geg 
Thr <51y Asn Ala Lys Thr Ala Gly Asn Tyr Ala Ser Ser Leu Lys Ala 
195 200 205 



€24 



cag cag gta gee gaa gag aaa gga ttt tet eaa gta etc tgg ctg gac 
Gin Gin Val Ala Glu Glu Lys -Gly Phe Ser -Gin Val Leu Trp Leu Asp 
210 215 220 



€72 



ggc att gag aag aaa tae ate gaa gaa gtc gga age atg aac ate ttc 
Cly lie Glu Lys Lys Tyr He Glu <Glu Val *Gly Ser Met Asn He Phe 
225 * 230 235 240 



720 



ttc aaa ate aac ggt gaa ate gta aca ccg atg ctg aac ggg age ate 
Phe Lys He Asn Gly Glu He Val Thr Pro Met Leu Asn -Gly Ser He 
24 5 250 255 



7€8 



ctg gaa ggc att aeg cgc aat tea gtc ate gee ttg ctt aag cat tgg 
Leu Glu Gly He Thr Arg Asn Ser Val He Ala Leu Leu Lys His Trp 
260 265 270 



816 



ggc ctt caa gtt tea gaa cga aaa att geg ate gat gag -gt-c ate eaa 
Gly Leu Gin Val Ser Glu Arg Lys He Ala lie Asp <Uu Val He Gin 
275 280 285 



364 



gee cat aaa gac ggc ate ctg gaa gaa gee ttc gga aca ggt aca gea 
Ala His Lys Asp Gly He Leu Glu Glu Ala Phe Gly Thr «Gly Thr Ala 
290 295 300 



912 



get gtt att tec cca gtc ggc gag ctg ate tgg cag gat gaa aca -ctt 
Ala Val He Ser Pro Val Gly Glu Leu He Trp Gin Asp <Uu Thr Leu 
305 310 315 320 



960 



teg ate aac aac ggt gaa aca gga gaa ate gea aaa aaa eta tat gac 
Ser He Asn Asn Gly Glu Thr Gly Glu He Ala Lys Lys Leu Tyr Asp 
325 330 335 



1008 



acg att aca ggc att caa aaa ggc get gtc -gea gac gaa t*c gga tgg 
Thr He Thr Gly He Gin Lys Gly Ala Val Ala Asp Glu Phe -Gly Trp 
340 345 350 



1056 



acg ace gaa gtc gea geg ctg act gaa age aag taa 
Thr Thr Glu Val Ala Ala Leu Thr Glu Ser Lys 
355 360 



1092 



<210> 63 
<211> 363 
<212> PRT 

<213> Bacillus subtilis 



<400> 63 

Met Thr Lys Gin Thr He Arg Val Glu Leu Thr Ser Thr Lys Lys Pro 
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15 10 15 

Lys Pro Asp Pro Asn Gin Leu Ser Phe Gly Arg Val Phe Thr Asp His 
20 25 30 

Met Phe Val Met Asp Tyr Ala Ala Asp Lys Gly Trp Tyr Asp Pro Arg 
' 35 40 45 

lie lie Pro Tyr Gin Pro Leu Ser Met Asp Pro Thr Ala Met, Val Tyr 
50 55 60 

His Tyr Gly Gin Thr Val Phe Glu Gly Leu Lys Ala Tyr Val Ser Glu 
65 70 75 80 

Asp Asp His Val Leu Leu Phe Arg Pro Glu Lys Asn Met Glu Arg Leu 
85 90 95 

Asn Gin Ser Asn Asp Arg Leu Cys He Pro Gin He Asp Glu Glu Gin 
100 105 110 

Val Leu Glu Gly Leu Lys Gin Leu Val Ala He Asp Lys Asp Trp lie 
115 120 125 

Pro Asn Ala Glu Gly Thr Ser Leu Tyr He Arg Pro Phe He He Ala 
130 135 140 

Thr Glu Pro Phe Leu Gly Val Ala Ala Ser His Thr Tyr Lys Leu Leu 
145 150 155 160 

lie He Leu Ser Pro Val Gly Ser Tyr Tyr Lys Glu Gly He Lys Pro 
165 ~ 170 175 

Val Lys He Ala Val Glu Ser Glu Phe Val Arg Ala Val Lys Gly Gly 
180 185 190 

Thr Gly Asn Ala Lys Thr Ala Gly Asn Tyr Ala Ser Ser Leu Lys Ala 
195 200 205 

Gin Gin Val Ala Glu Glu Lys Gly Phe Ser Gin Val Leu Trp Leu Asp 
210 215 220 

Gly He Glu Lys Lys Tyr lie Glu Glu Val Gly Ser Met Asn lie Phe 
225 230 235 240 

Phe Lys lie Asn Gly Glu He Val Thr Pro Met Leu Asn Gly Ser lie 
245 250 255 

Leu Glu Gly lie Thr Arg Asn Ser Val lie Ala Leu Leu Lys His Trp 
260 265 270 

Gly Leu Gin Val Ser Glu Arg Lys He Ala He Asp Glu Val lie Gin 
275 280 285 

Ala His Lys Asp Gly lie Leu -Glu Glu Ala Phe Gly Thr Gly Thr Ala 
290 295 300 

Ala Val lie Ser Pro Val Gly Glu Leu lie Trp Gin Asp Glu Thr Leu 
305 310 315 320 
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Ser He Asn Asn Gly Glu 'Thr Gly Glu He Ala Lys Lys Leu Tyr Asp 
325 330 335 

Thr He Thr Gly He Gin Lys Gly Ala Val Ala Asp Glu Phe Gly Trp 
34 0 34 5 350 

Thr Thr Glu Val Ala Ala Leu Thr Glu Ser Lys 
355 360 



<210> 64 
<211> 1071 
<212> DNA 

<213> Bacillus subtilis 
<220> 

<221> CDS , 
<222> (1) . . (1068) 

<400> 64 

ttg aat aag ctt att gaa cga gaa aaa act gta tat tat aag gaa aag 4*8 

Met Asn Lys Leu He Glu Arg Glu Lys Thr Val Tyr Tyr iys Glu Lys 

1 5 10 IS 

ccc gac ccg tct tec ttg ggg ttt gga caa tat ttt aca -gat tat atg 96 
Pro Asp Pro Ser Ser Leu Gly Phe Gly Gin Tyr Phe Thr Asp Tyr Met 

20 -25 30 

ttt gtg atg gac tac gaa gag ggg at± gga tgg cat 'cat ccg aga att 14 4 
Phe Val Met Asp Tyr Glu -Glu Gly lie Gly Trp His His Pro Arg lie 
35 40 45 

gcg ccg tac gca ccg ctt acg ctt gat ccg tct tea tct gtt ttt cat 192 
Ala Pro Tyr Ala Pro Leu Thr Leu Asp Pro Ser Ser Ser Val Phe His 
50 55 60 

tac ggc cag get gtt ttt gaa gga tta aaa gca tac aga aca gac gac 240 
Tyr Gly Gin Ala Val Phe Glu Gly Led Lys Ala Tyr Arg Thr Asp Asp 
65 70 75 SO 

ggc agg gtg ctg ctg ttc cgt ccg gat caa aat ate aaa egg etg aac 288 
Gly Arg Val Leu Leu Phe Arg Pro Asp Gin Asn lie Lys *Arg Leu Asn 
85 90 95 

aga teg tgt gag cgc atg age atg ccc cct tta gac gaa gag ctg gtg 336 
Arg Ser Cys Glu Arg Met Ser Met Pro Pro Leu Asp Glu Glu Leu Val 
100 105 110 

ctt gag gca ttg acg caa tta gtt gag ctg gag aaa gat -tqq gtt cea 384 
Leu Glu Ala Leu Thr Gin Leu Val Glu Leu Glu Lys Asp Trp Val Pxo 
115 120 125 

aag gaa aaa gga acg tea ctg tat att egt -cct ttt gtc att gec aca 432 
Lys Glu Lys Gly Thr Ser Leu Tyr He Arg Pro Phe Val He Ala Thr 
130 " 135 140 

gaa ccg agt etc ggt gtg aag gca tee agg age tat aca ttt atg ate 480 
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Glu Pro Ser Leu Gly Val Lys Ala Ser Arg Ser Tyr Thr Phe Met lie 

145 150 155 160 

gtg ctt teg cct gtc ggc tec tat tat ggc gac gat cag ctg aag ccg 528 

Val Leu Ser Pro Val Gly Ser Tyr Tyr -Gly Asp Asp Gin Leu Lys Pro 

165 170 175 

i 

gtt aga ate tat gtc gaa gat gag tat gtg agg gcg gtc aac gga gga 576 

Val Arg lie Tyr Val Glu Asp Glu Tyr Val Arg Ala Val Asn Gly Gly 
180 185 190' 

gtc ggg ttt gca aaa acg get gga aac tat gec gec agt ett cag gca 624 

Val Gly Phe Ala Lys Thr Ala Gly Asn Tyr Ala Ala Ser Leu Gin Ala 
195 200 205 

cag egg aaa gcg aat gaa ctg ggc tat gac cag gta ctg tgg ctg gac 612 

Gin Arg Lys Ala Asn Glu Leu Gly Tyr Asp Gin Val Leu Trp Leu Asp 

210 215 220 

gec ate gaa aag aaa tat gtg gaa gaa gta ggg age atg aac ate ttt 720 

Ala He Glu Lys Lys Tyr Val Glu Glu Val Gly Ser Met Asn He Phe 

225 230 235 ( 240 

' ttc gtc ata aac ggg gaa get gtc aca cct get tta age gga age att 768 

Phe Val He Asn Gly Glu Ala Val Thr Pro Ala Leu Ser Gly Ser He 

245 250 255 

tta age ggg gtt aca cgt gcg tct gcg att gaa ttg att cga age tgg 816 

Leu Ser Gly Val Thr Arg Ala Ser Ala He Glu Leu He Arg Ser Trp 
260 265 270 

ggc att ccg gtt cgt gaa gag aga ata teg att gat gag gtg tat <gcg 864 

Gly He Pro Val Arg Glu Glu Arg He Ser He Asp Glu Val Tyr Ala 
275 280 285 

gec tct gca cgc gga gaa ttg aca gag gtc ttt ggc aca ggc acg gca 912 

Ala Ser Ala Arg Gly Glu Leu Thr Glu Val Phe Gly Thr Gly Thr Ala 

290 295 300 

gca gtc gtt acg cct gtc ggt gaa etc aac ate cat gga aaa acg gtg 960 

Ala Val Val Thr Pro Val Gly Glu Leu Asn He His Gly Lys Thr Val 

305 310 315 " ^ 320 

att gta ggc gac ggg caa ate ggg gac etc teg aaa aag ctg tat gaa 1008 

He Val Gly Asp Gly Gin He Gly Asp Leu Ser Lys Lys Leu Tyr Glu 

325 330 ~ " 335 

acg ata aca gat att cag ctt ggc aag gta aaa ggc ccg ttt aac tgg 1056 

Thr He Thr Asp He Gin Leu Gly Lys Val Lys Gly Pro Phe Asn Trp 
340 345 350 

aca gtg gaa gtg tga 1071 
Thr Val Glu Val 
355 



<210> 65 
<211> 356 
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<212> PfcT 

<213> Bacillus subtilis . 
<400> 65 

Met Asn Lys Leu lie Glu Arg Glu Lys Thr Val Tyr Tyr Lys Glu Lys 
1 5 10 15 

Pro Asp Pro Ser Ser Leu Gly Phe <51y Gin Tyr Phe Thr Asp Tyr Wet 
20 25 30 

Phe Val Met Asp Tyr Glu Glu -Gly He Gly Trp His His Pro Arg lie 
35 40 45 

Ala Pro Tyr Ala Pro Leu Thr Leu Asp Pro "Ser Ser Ser Val Phe His 
50 55 60 

Tyr Gly Gin Ala Val Phe Glu -Gly Leu Lys Ala Tyr Arg Thr Asp Asp 
65 70 75 ( 30 

Gly Arg Val Leu Leu Phe Arg Pro Asp Gin Asn lie Lys Arg Leu Asn 
85 90 95 

Arg Ser Cys Glu Arg Met Ser Met Pro Pro Leu Asp Glu -Glu Leu Val 
100 105 HO 

Leu Glu Ala Leu Thr Gin Leu Val Glu Leu Glu Lys Asp Trp Val Pro 
115 ' ' 120 125 

Lys Glu Lys Gly Thr Ser Leu Tyr He Arg Pro Phe Val He Ala Thr 
130 135 140 

Glu Pro Ser Leu Gly Val Lys Ala Ser Arg "Ser Tyr Thr Phe Met He 
145 150 155 1*0 

Val Leu Ser Pro Val -Gly Ser Tyr Tyr Gly Asp Asp Gin Leu Lys Pro 
165 170 175 

Val Arg He Tyr Val Glu Asp Glu Tyr Val Arg Ala Val Asn Gly Gly 
180 185 190 

Val Gly Phe Ala Lys Thr Ala Gly Asn Tyr Ala Ala Ser Leu Gin Ala 
195 200 205 

Gin Arg Lys Ala Asn Glu Leu Gly Tyr Asp Gin Val Leu Trp Leu Asp 
210 215 220 

Ala He Glu Lys Lys Tyr Val Glu Glu Val <Gly Ser Met Asn He Phe 
225 ' 230 235 240 

Phe Val He Asn Gly Glu Ala Val Thr Pro Ala Leu Ser Gly Ser He 
24 5 250 255 

Leu Ser Gly Val Thr Arg Ala Ser Ala lie -Glu Leu lie Arg Ser Trp 
260 265 270 

Gly He Pro Val Arg Glu Glu Arg He Ser He Asp Glu Val Tyr Ala 
275 280 -285 
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Ala Ser Ala Arg Gly Glu Leu Thr Glu Val Phe Gly Thr Gly Thr Ala 
290 295 300 

Ala Val Val Thr Pro Val Gly Glu Leu Asn He His Gly Lys Thr Val 
305 310 315 ' 320 

' He Val Gly Asp Gly -Gin He Gly Asp Leu Ser Lys Lys Leu Tyr Glu 
325 330 335 

Thr He Thr Asp He Gin Leu Gly Lys Val Lys Gly Pro Phe Asn Trp 
340 34 5 350 

Thr Val Glu Val 
355 



<210> 66 
<211> 1428 
<212> DMA 

<213> Bacillus subtilis 

<220> ( 

<221> CDS 

<222> (1) . . (1425) 

<400> 66 

atg tta aac ggc caa aaa gaa tat cgc gtg gaa aaa gac ttc ctt ggg 48 

Met Leu Asn Gly Gin Lys Glu Tyr Arg Val Glu Lys Asp Phe Leu Gly 
1 5 10 ' 15 

gaa aaa caa att gaa gca gat gtt tat tac gga att cag acg etc cgt 96 
Glu Lys Gin He Glu Ala Asp Val Tyr Tyr Gly He Gin Thr Leu Arg 
20 25 30 

get tct gaa aat ttt ccg ate aca gga tac aaa ate cat gag gaa atg 144 
Ala Ser Glu Asn Phe Pro He Thr Gly Tyr Lys He His Glu Glu Met 
35 40 45 

att aac gca ctg gcg att gtg aaa aaa get gcg get ctt gee aac atg 192 
He Asn Ala Leu Ala He Val Lys Lys Ala Ala Ala Leu Ala Asn Met 
50 55 60 

gac gtg aaa egg ctg tat gaa gga att ggc caa get ate gta caa gee 240 
Asp Val Lys Arg Leu Tyr Glu Gly He Gly Gin Ala He Val Gin Ala 
65 70 75 80 

get gac gag att ctg gaa ggc aag tgg cac gat cag ttt ate gtc gat 288 
Ala Asp Glu He Leu Glu Gly Lys Trp His Asp Gin Phe He Val Asp 
85 90 95 

ccg att cag ggc ggt gee gga act tct atg aac atg aac gcg aat gag 336 
Pro He Gin Gly Gly Ala Gly Thr Ser Met Asn Met Asn Ala Asn Glu 
100 105 HO 



gtt ate gga aac egg gcg ctt gaa ate atg gga cat aaa aag gga gat 384 
Val He Gly Asn Arg Ala Leu Glu He Met Gly His Lys Lys Gly Asp 
115 120 125 
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tat ate cat tta agt cca aac aca cat gtg aac atg tea cag tct cag 
Tyr He His Leu Ser Pro Asn Thr His Val Asn Met 'Ser Gin Ser Gin 
130 135 140 

aac gat gtg ttc ccg act -get ate cat att tec aca ttg aag etc tta 
Asn Asp Val Phe Pro Thr Ala He His He Ser Thr Leu- Lys -Leu Leu 
145 ISO 155 1*0 

gaa aaa ctg ctg aaa aca atg gaa gat atg eat agt gtg ttt aaa caa 
Glu Lys Leu Leu Lys Thr Met *Glu Asp Met His Ser Val -Phe Lys Gin 
165 170 175 



432 



480 



528 



aaa gca cag gag ttt cac tet gtt att aaa atg ggc egg aca cac ctt 576 
Lys Ala Gin Glu Phe His Ser Val He Lys Met Gly Arg Thr His Leu 
180 185 190 

caa gat gcg gtt ccg ate cgt ctt ggc cag gaa ttc gaa get tac age 624 
Gin Asp Ala Val Pro He Arg Leu Gly Gin Glu Phe Glu Ala Tyr Ser 
195 200 205 

cgt gtt etc gag cgt gat ate aaa ega ate aag caa tCg cgc cag cac £72 
Arg Val Leu Glu Arg Asp He Lys Arg He Lys Gin Ser Arg Gin His 
210 215 220 

ctg tat gaa gtc aac atg ggc gca act get gtt ggt aca ggg ctg aac 
Leu Tyr Glu Val Asn Met Gly Ala Thr Ala Val Gly Thr Gly Leu Asn 
225 230 " 235 240 

get gat cct gaa tat ate aaa cag gta gta aag cac ctt ^fct gat att 
Ala *sp Pro Glu Tyr He Lys Gin Val Val Lys His Leu Ala Asp lie 
24 5 250 255 

age ggg ctt cct ctt gtc ggc get gat cat ctt gtt gat gcg aca caa 
Ser Gly Leu Pro Leu Val Gly Ala Asp His Leu Val Asp Ala Thr Gin 
260 265 270 

aat aca gat gec tat aca gag gta tea get tea tta aaa gtc tec atg 
Asn Thr Asp Ala Tyr Thr Glu Val Ser Ala Ser Leu Lys Val Cys Met 
275 280 285 

atg aac atg teg aag ate gca aac gac ctg cgc tta atg gcg teg gga 
Met Asn Met Ser Lys He Ala Asn Asp Leu Arg Leu Met Ala Ser -Gly 
290 295 300 

ccg cgc gec gga ctt gcg gaa att tct ctg cct gca cgt cag ccg <ggt 
Pro Arg Ala Gly Leu Ala Glu He Ser Leu Pro Ala Arg Gin Pro Gly 
305 ' 310 315 320 

tea tct att atg ccg ggg aaa gtc aat ccg gtt atg gcg gag ctg ate 
Ser Ser He Met Pro Gly Lys Val Asn Pro Val Met Ala Glu Leu He 
325 330 335 

aac caa att gcg ttc cag gtt ate gga aat -gac aat aca ate tgc ctt 
Asn Gin He Ala Phe Gin Val He Gly Asn Asp Asn Thr lie Cys Leu 
340 345 350 

get tea gaa gec ggc cag ctt gag ttg aac gtc atg. gag ccc gtg ctt 
Ala Ser Glu Ala Gly Gin Leu Glu Leu Asn Val Met Glu Pro Val Leu 



7^0 



7<68 



.816 



8-64 



912 



960 



1008 



1056 



1104 
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355 360 365 

gtc ttt aat ttg ctt caa, tec ate age ate atg aac aac ggc ttc cgt 1152 
Val Phe Asn Leu Leu Gin Ser He Ser He Met Asn Asn Gly Phe Arq 
370 375 380 

teg ttc act gac aac tgc tta aaa ggc att gaa gee aac gaa aag cgt 1^00 
Ser Phe Thr Asp Asn Cys Leu Lys Gly He Glu Ala Asn Glu Lys Arg 
3 85 390 395 40O 

atg aag caa tac gta gaa aaa age gca gge gtg ate aca get gtc aat 1248 
Met Lys Gin Tyr Val Glu. Lys Ser Ala Gly Val He Thr Ala Val Asn 
405 410 415 

ccg cat ctt ggg tat gaa gcg gca get aga att gee agg gaa gca att 1296 
Pro His Leu Gly Tyr Glu Ala Ala Ala Arg He Ala Arg Glu Ala lie 
420 425 430 

atg aca ggg caa tct gtc egg gat ctt tgt ctg cag cat gat gtg ctg 134 4 
Met Thr Gly Gin Ser Val Arg Asp Leu Cys Leu Gin His Asp Val Leu 
435 440 445 

act gaa gaa gaa ttg gat att att tta aac cca tat gag atg acc aaa 1392 
Thr Glu Glu Glu Leu Asp He He Leu Asn Pro Tyr Glu Met Thr Lys 
450 455 460 

cca ggt ate gca ggg aaa gaa eta tta gaa aaa taa 1428 
Pro Gly He Ala Gly Lys, Glu Leu Leu Glu Lys 
465 470 475 

<210> 67 

<211> 475 

<212> PRT 

<213> Bacillus subtilis 

<400> 67 

Met Leu Asn Gly Gin Lys Glu Tyr Arg Val Glu Lys Asp Phe Leu Gly 
1 5 10 15 

Glu Lys Gin He Glu Ala Asp Val Tyr Tyr Gly He Gin Thr Leu Arg 
20 25 30 

Ala Ser Glu Asn Phe Pro He Thr Gly Tyr Lys He His Glu Glu Met 
35 40 45 

He Asn Ala Leu Ala He Val Lys Lys Ala Ala Ala Leu Ala Asn Met 
50 55 60 

Asp Val Lys Arg Leu Tyr Glu Gly He Gly Gin Ala He Val Gin Ala 
65 70 75 80 

Ala Asp Glu He Leu Glu Gly Lys Trp His Asp Gin Phe He Val Asp 
85 90 95 

Pro He Gin Gly Gly Ala Gly Thr Ser Met Asn Met Asn Ala Asn Glu 
100 105 no 
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Val He Gly Asn Arg Ala Leu Glu He Wet Gly His Lys Lys Gly Asp 
115 120 125 

Tyr He His Leu Ser Pro Asn Thr His Val Asn Met Ser -Gin Ser Gin 
130 135 140 

Asn Asp Val Phe Pro Thr Ala He His He Ser Thr Leu Lys Leu Leu 
145 150 155 . 160 

Glu Lys Leu Leu Lys Thr Met Glu Asp Met His Ser Val Phe Lys Gin 
165 170 175 

Lys Ala Gin Glu Phe His Ser Val He Lys Met Gly Ar<g Thr Hrs Leu 
180 185 190 

Gin Asp Ala Val Pro He Arg Leu Gly Gin Glu Phe <31u Ala Tyr Ser 
195 200 205 

Arg Val Leu Glu Arg Asp He Lys Arg He Lys Gin Ser Arg 'Gin His 
210 215 220 

i 

Leu Tyr Glu Val Asn Met Gly Ala Thr Ala Val Gly Thr Gly Leu Asn 
225 230 235 240 

Ala Asp Pro Glu Tyr He Lys Gin Val Val Lys His Leu Ala Asp He 
24 5 250 255 

Ser Gly Leu Pro Leu Val Gly Ala Asp His Leu Val Asp Ala Thr Gin 
260 265 270 

Asn Thr Asp Ala Tyr Thr Glu Val Ser Ala Ser Leu Lys Val -Cys Wet 
275 280 28S 

Met Asn Met Ser Lys He Ala Asn Asp Leu Arg Leu Met Ala Ser <31y 
290 295 300 

Pro Arg Ala Gly Leu Ala Glu He Ser Leu -Pro Ala Arg Gin Pro Gly 
305 310 315 320 

Ser Ser He Met Pro Gly Lys Val Asn Pro Val -Met Ala <31u Leu He 
325 330 335 

Asn Gin He Ala Phe Gin Val He <31y Asn Asp Asn Thr He -Cys Leu 
340 345 350 

Ala Ser Glu Ala Gly Gin Leu Glu Leu Asn Val Met Glu Pro Val Leu 
355 360 365 

Val Phe Asn Leu Leu Gin Ser lie Ser lie Met Asn Asn Gly Phe Arg 
370 375 380 

Ser Phe Thr Asp Asn Cys Leu Lys Gly He .Glu Ala Asn Glu Lys Arg 
385 390 395 400 

Met Lys Gin Tyr Val Glu Lys Ser Ala Gly Val He Thr Ala Val Asn 
405 410 415 

Pro His Leu Gly Tyr Glu Ala Ala Ala Arg He Ala Arg <Glu Ala He 
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420 425 430 

Met Thr Gly Gin Ser Val, Arg Asp Lea Cys Leu Gin His Asp Val Leu 
435 440 445 

Thr Glu Glu Glu Leu Asp lie lie Leu Asn Pro Tyr Glu Met Thr Lys 
450 455 460 

Pro Gly He Ala Gly Lys Glu Leu Leu Glu Lys 

475 



4 65 


470 


<210> 


68 


<211> 


768 


<212> 


DNA 


<213> 


Bacillus subtilis 


<220> 




<221> 


CDS 


<222> 


(1) . . (765) 


<400> 


68 


atg aaa cga gaa age aac 


Met Lys Arg Glu Ser Asn 


1 


5 



t caa gtg etc age cgt ggt caa aaa gat 48 
e Gin Val Leu Ser Arg Gly Gin Lys Asp 
10 15 

cag cct gtg age cag att tat caa gta tea aca atg act tct eta tta 96 
Gin Pro Val Ser Gin He Tyr Gin Val Ser Thr Met Thr Ser Leu Leu 
20 25 30 

gac gga gta tat gac gga gat ttt gaa ctg tea gag att ccg aaa tat 144 
Asp Gly Val Tyr Asp Gly Asp Phe Glu Leu Ser Glu He Pro Lys Tyr 
35 40 45 

gga gac ttc ggt ate gga ace ttt aac aag ctt gac gga gag ctg att 192 
Gly Asp Phe Gly He Gly Thr Phe Asn Lys Leu Asp Gly Glu Leu He 
50 55 60 

ggg ttt gac ggc gaa ttt tac cgt ctt cgc tea gac gga ace gcg aca 240 
Gly Phe Asp Gly Glu Phe Tyr Arg Leu Arg Ser Asp Gly Thr Ala Thr 
65 70 75 80 

ccg gtc caa aat gga gac cgt tea ccg ttc tgt tea ttt acg ttc ttt 288 
Pro Val Gin Asn Gly Asp Arg Ser Pro Phe Cys Ser Phe Thr Phe Phe 
85 90 95 

aca ccg gac atg acg cac aaa att gat gcg aaa atg aca cgc gaa gac 336 
Thr Pro Asp Met Thr His Lys He Asp Ala Lys Met Thr Arg Glu Asp 
100 105 HO 

ttt gaa aaa gag ate aac age atg ctg cca age aga aac tta ttt tat 384 
Phe Glu Lys Glu He Asn Ser Met Leu Pro Ser Arg Asn Leu Phe Tyr 
115 120 125 

gca att cgc att gac gga ttg ttt aaa aag gtg cag aca aga aca gta 432 
Ala He Arg He Asp Gly Leu Phe Lys Lys Val Gin Thr Arg Thr Val 
130 135 140 
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gaa ctt caa .gaa aaa cct tac gtg eca at^ gtt gaa gcg gtc aaa aca 4*80 

Glu Leu Gin <51u Lys Pro Tyr Val Pro Met Val -Glu Ala Val Lys Thr 
145 ISO 155 160 

eag ccg att ttc aac ttc gac aac gtg aga gga acg att gta ggt tt-c 
Gin Pro He Phe Asn Phe Asp Asn Val Arg Gly Thr He Val Gly Phe 
165 170 175 

ttg aca cca get tat gca aac gga ate gec gtt tct -ggc tat cac ctg 
Leu Thr Pro Ala Tyr Ala Asn <51y lie Ala Val Ser Gly Tyr His Leu 
180 185 190 

cac ttc att gac gaa gga cgc aat tea ggc gga «cac gtt ttt gac tat *624 
His Phe He Asp Glu Gly Arg Asn Ser Gly Gly His Val Phe Asp Tyr 
195 * 200 205 

gtg ctt gag gat tgc acg gtt acg att tct caa aaa atg aac atg aat 672 
Val Leu Glu Asp Cys Thr Val Thr He Ser Gin Lys Met Asn Met Asn 
210 215 220 



etc aga ctt ccg aac aca gcg gat ttc ttt aat gcg aat ctg gat aac 
,Leu Arg Leu Pro Asn Thr Ala Asp Phe Phe Asn Ala Asn Leu Asp Asn 
*225 230 235 240 



528 



57* 



720 



cct gat ttt gcg aaa gat ate gaa aca act gaa gga age cct -gaa taa 766 
Pro Asp Phe Ala Lys Asp He Glu Thr Thr Glu Gly Ser Pro <Glu 
24 5 » 250 • 255 



<210> 69 
<211> -255 
<212> PRT 

<213> Bacillus subtilis 
<400> 69 

Met Lys Arg Glu Ser Asn He Gin Val Leu Ser Arg "Gly Gin Lys Asp 
1 5 10 15 

Gin Pro Val Ser Gin He Tyr Gin Val Ser Thr Met Thr Ser Leu Leu 
20 25 30 

Asp Gly Val Tyr Asp Gly Asp Phe Glu -Leu Ser -Glu He Pro Lys Tyr 
35 40 45 

Gly Asp Phe Gly He Gly Thr Phe Asn Lys I.eu Asp Gly -Glu Leu He 
50 " 55 60 

Gly Phe Asp Gly Glu Phe Tyr Arg Leu Arg Ser Asp Gly Thr Ala Thr 
65 70 75 80 

Pro Val Gin Asn Gly Asp Arg Ser -Pro Phe Cys Ser Phe Thr Phe Phe 
85 90 95 

Thr Pro Asp Met Thr His Lys He Asp Ala Lys Met Thr Arg -Glu Asp 
100 • 105 HO 

Phe Glu Lys Glu He Asn Ser Met Leu Pro Ser Arg Asn Leu £»he Tyr 
115 120 125 
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Ala He Arg He Asp Gly Leu Phe Lys Lys Val Gin Thr Arg Thr Val 
130 ,135 140 

Glu Leu Gin Glu Lys Pro Tyr Val Pro Met Val Glu Ala Val Lys Thr 
145 150 155 160 

Gin Pro He Phe Asn Phe Asp Asn Val Arg Gly Thr He Val Gly Phe 
165 170 " 175 

Leu Thr Pro Ala Tyr Ala Asn Gly He Ala Val Ser Gly Tyr His Leu 
180 185 190 

His Phe He Asp Glu Gly Arg Asn Ser Gly Gly His Val Phe Asp Tyr 
195 200 205 

Val Leu Glu Asp Cys Thr Val Thr He Ser Gin Lys Met Asn Met Asn 
210 215 220 

Leu Arg Leu Pro Asn Thr Ala Asp Phe Phe Asn Ala Asn Leu Asp Asn 
225 230 235 240 

Pro Asp Phe Ala Lys Asp He Glu Thr Thr Glu Gly Ser Pro Glu 
245 250 " 255 



<210> 70 

<211> 1254 

<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . (1251) 

<400> 70 

atg aca ttc tec ctt ttt ggt gac aaa ttt acc cgc cac tec ggc att 48 

Met Thr Phe Ser Leu Phe Gly Asp Lys Phe Thr Arg His Ser Gly He 

1 5 10 15 

acg ctg ttg atg gaa gat ctg aac gac ggt tta cgc acg cct ggc gcg 96 
Thr Leu Leu Met Glu Asp Leu Asn Asp Gly Leu Arg Thr Pro Gly Ala 
20 25 30 

att atg etc ggc ggc ggt aat ccg gcg cag ate ccg gaa atg cag gac 14 4 
He Met Leu Gly Gly Gly Asn Pro Ala Gin He Pro Glu Met Gin Asp 
35 40 45 

tac ttc cag acg eta ctg acc gac atg ctg gaa agt ggc aaa gcg act 192 
Tyr Phe Gin Thr Leu Leu Thr Asp Met Leu Glu Ser Gly Lys Ala Thr 
50 55 60 

gat gca ctg tgt aac tac gac ggt cca cag ggg aaa acg gag eta etc 240 
Asp Ala Leu Cys Asn Tyr Asp Gly Pro Gin Gly Lys Thr Glu Leu Leu 
65 70 75 80 

aca ctg ctt gee gga atg ctg cgc gag aag ttg ggt tgg gat ate gaa 288 
Thr Leu Leu Ala Gly Met Leu Arg Glu Lys Leu Gly Trp Asp He Glu 
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85 90 , 95 

cca cag aat att gca eta aea aac ^gc age cag a^c geg ttt ttc tac 33* 
Pro Gin Asn He Ala Leu Thr Asn Gly Ser Gin Ser Ala *>he Phe Tyr 
, 100 105 HO 

tta ttt aac ctg ttt gcc gga cgc cgt ^cc gat c.gt egg gte aaa aaa 384 
Leu Phe Asn Leu Phe Ala Gly Arg Arg Ala Asp Gly Arg Val Lys Lys 
115 120 125 

gtg ctg ttc ccg ctt gca ccg gaa tac att ggc tat get gac gcc gga 4 32 
Val Leu Phe Pro Leu Ala Pro Glu Tyr He Gly Tyr Ala Asp Ala Gly 
130 135 140 

ctg gaa gaa gat ctg ttt gtc tct geg egt ccg aat att gaa ctg ctg 48.0 
Leu Glu Glu Asp Leu Phe Val Ser Ala Arg Pro Asn He «Glu Leu Leu 
14 5 150 155 160 

ccg gaa ggc cag ttt aaa tac cac gtc gat ttt gag cat ctg cat att 52B 
Pro Glu Gly Gin Phe Lys Tyr His Val Asp Phe Glu His Leu His He 
165 1*?0 1 175 

ggc gaa gaa acc ggg atg att tgc gtc tec egg ccg acg aat cca aca 576 
Gly Glu Glu Thr Gly Met He Cys Val Ser Arc. Pro Thr Asn Pro Thr 
180 185 ISO 

ggc aat gtg att act gac gaa gag' ttg ctg aag ctt qac ' *gcg ct<g -ggc 624 
Gly Asn Val 11^ Thr Asp Glu Glu Leu Leu Lys «Leu Asp Ala -Leu Gly 
195 200 205 

aat caa cac ggc att ccg ctg gtg att gat aac get tat ggc gtc ccg -672 
Asn Gin His Gly He Pro Leu Val He Asp Asn Ala Tyr Gly Val Pro 
210 215 220 



ttc ccg ggt ate ate ttc agt gaa geg cgc ccg eta tgg aat ccg aat 
Phe Pro Gly He He Phe Ser Glu Ala Arg Pro Leu Trp Asn Pro Asn 
225 230 235 240 

ate gtg ctg tgc atg agt ctt tec aag ctg ggt eta cct ggc 4:cc cgc 
He Val Leu Cys Met Ser Leu Ser Lys Leu Gly Leu Pro Gly Ser Arg 
24 5 250 255 

tgc ggc att ate ate gee aat gaa aaa ate ate acc cec ate acc aat 
Cys Gly He He He Ala Asn Glu Lys He He Thr Ala He Thr Asn 
260 265 270 

atg aac ggc att ate age ctg gca cct ggc ggt att -ggt ccg geg atg 
Met Asn Gly He He Ser Leu Ala Pro Gly Gly He Gly Pro Ala Met 
275 280 285 

atg tgt gaa atg att aag cgt aac gat ctg ctg cgc ctg tct -gaa aca 
Met Cys Glu Met He Lys Arg Asn Asp Leu Leu Arg Leu -Ser Glu Thr 
290 295 300 

gtc ate aaa ccg ttt tac tac cag cgt gtt cag gaa act ate gcc ate 
Val He Lys Pro Phe Tyr Tyr Gin Arg Val Gin Glu Thr lie Ala lie 
305 310 315 320 
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att cgc cgc tat tta ccg gaa aat cgc tgc ctg att cat aaa ccg gaa , 1008 

He Arg Arg Tyr Leu Pro Glu Asn Arg Cys Leu He His Lys Pro Glu 
325 , 330 335 

gga gcc att ttc etc tgg eta tgg ttt aag gat ttg ccc att acg acc 1056 

Gly Ala He Phe Leu Trp Leu Trp Phe Lys Asp Leu Pro He Thr Thr 

340 345 350 

aag cag etc tat cag cgc ctg aaa gca cgc ggc gtg ctg atg gtg ccg 1104 

Lys Gin Leu Tyr Gin Arg Leu Lys Ala Arg Gly Val Leu Met Val Pro 
355 360 365 

ggg cac aac ttc ttc cca ggg ctg gat aaa ccg tgg ccg cat acg cat 1152 

Gly His Asn Phe Phe Pro Gly Leu Asp Lys Pro Trp Pro His Thr His 
370 375 380 

cea tgt atg cgc atg aac tac gta cca gag ccg gag aaa att gag gcg 1200 

Gin Cys Met Arg Met Asn Tyr Val Pro Glu Pro Glu Lys He Glu Ala 
385 390 395 ~ 400 

ggg gtg aag att ctg gcg gaa gag ata gaa aga gcc tgg get gaa agt 1248 

Gly Val Lys He Leu Ala Glu Glu He Glu Arg Ala Trp Ala Glu Ser 
405 410 415 



cac taa 
His 



<210> 71 
<211> 417 
<212> PRT 

<213> Escherichia coli 
<400> 71 

Met Thr Phe Ser Leu Phe Gly Asp Lys Phe Thr Arg His Ser Gly lie 
1 5 10 15 

Thr Leu Leu Met Glu Asp Leu Asn Asp Gly Leu Arg Thr Pro Gly Ala 
20 25 30 

He Met Leu Gly Gly Gly Asn Pro Ala Gin He Pro Glu Met Gin Asp 
35 40 45 

Tyr Phe Gin Thr Leu Leu Thr Asp Met Leu Glu Ser Gly Lys Ala Thr 
50 55 60 

Asp Ala Leu Cys Asn Tyr Asp Gly Pro Gin Gly Lys Thr Glu Leu Leu 
65 70 75 80 

Thr Leu Leu Ala Gly Met Leu Arg Glu Lys Leu Gly Trp Asp He Glu 
85 90 95 

Pro Gin Asn He Ala Leu Thr Asn Gly Ser Gin Ser Ala Phe Phe Tyr 
100 105 no 

Leu Phe Asn Leu Phe Ala Gly Arg Arg Ala Asp Gly Arg Val Lys Lys 
115 120 " 125 
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Val Leu Phe Pro Leu Ala Pro X31u Tyr He Gly Tyr Ala Asp Ala -Gly 
130 135 140' 

Leu Glu Glu Asp Leu Phe Val Ser Ala Arg Pro Asn He <51u Leu Leu 
^45 150 155 1*0 

Pro Glu Gly Gin Phe Lys Tyr His Val Asp Phe t31u -His Leu His He 
165 170 ,175 

Gly Glu Glu Thr -Gly Met Il« Cys Val Sfer Arg Pro Thr Asn Pro Thr 
180 185 130 

Gly Asn Val He Thr Asp Glu Glu Leu Leu Lys Leu Asp Ala Leu Gly 
195 200 205 

Asn Gin His Cly He Pro Leu Val He Asp Asn Ala Tyr Gly Val *ro 
210 215 220 

Phe Pro Gly He He Phe Ser Glu Ala Arg Pro Leu Trp Asn Pro Asn 
225 230 235 240 

i 

,He Val Leu Cys Met Ser Leu Ser Lys Leu -Gly Leu Pro £ly Ser Arg 
245 250 255 

Cys Gly He He lie Ala Asn «Glu Lys He lie Thr Ala He Thr Asn 
260 2«65 270 

Met Asn Gly He He Ser Leu Ala Pro <51y "Gly He tSly -Pro Ala Met 
275 280 235 

Met Cys Glu Met He Lys Arg Asn Asp Leu Leu Arg Leu Ser Glu Thr 
290 295 300 

Val He Lys Pro Phe Tyr Tyr Gin Arg Val ^ln Glu Thr He Ala He 
305 310 315 320 

He Arg Arg Tyr Leu Pro Glu Asn Arg Cys ieu He His Lys Pro <31u 
325 330 335 

Gly Ala He Phe Leu Trp Leu Trp Phe Lys Asp Leu Pro He Thr Thr 
340 345 350 

Lys Gin Leu Tyr Gin Arg Leu Lys Ala Arg <Gly Val Leu Met Val Pro 
355 360 365 

Gly His Asn Phe Phe Pro Gly Leu Asp Lys Pro Trp Pro His Thr His 
370 375 380 

Gin Cys Met Arg Met Asn Tyr Val Pro <Glu Pro £lu Lys He XSlu Ala 
385 390 395 400 

Gly Val Lys He Leu Ala Glu Glu He Glu Arg Ala Trp Ala <Slu Ser 
405 410 415 



His 
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<210> 72 
<211> 8803 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Recombinant 
pAN294 plasmid 

<400> 72 

tgcgccgcta cagggcgcgt ccattcgcca ttcaggctgc gcaactgttg ggaagggcga 60 
tcggtgcggg cctcttcgct attacgccag ctggcgaaag ggggatgtgc tgcaaggcga 120 
ttaagttggg taacgccagg gttttcccag tcacgacgtt gtaaaacgac ggccagtgaa 180 
ttgtaatacg actcactata gggcgaattg ggcccgacgt cgcatgctgg atgaaaagcc 240 
gatgaccgct tttcaggtct gtcagcagct ttttcctgct gtatat<^aaa aggaattgtt 300 
tttaacgatg tcagaaacgg caggtcacct tgatgtgttg gaggctgaag aagccatcac 360 
gtcatattgg gaaggaaata ccgtatactt taaaacaatg aagaggtgaa atgggtgaaa 420 
catatagcgg gaaaaaggat ttggataacc ggcgcttcag gagggcttgg agaaagaatc 480 
gcatacttat gcgcggctga aggagcccat gtcctgctgt cggctagacg cgaggatcgt 540 
ttgatagaaa tcaaaaggaa aataaccgag gaatggagcg gacagtgtga gatttttcct 600 
ctggatgtcg gccgcctaga ggatatcgcc cgggtccgcg atcagatcgg ctcgattgat 660 
gtactgatta acaatgcagg cttcggtata tttgaaacgg ttttagactc tacattggat 720 
gacatgaaag cgatgtttga tgtgaatgtc ttcggcctga tcgcctgtac aaaagcggtg 780 
cttccgcaaa tgcttgagca aaaaaaggga catatcatca atatcgcctc tcaagcgggg 840 
aaaatcgcca caccgaagtc tagcctgtat tccgcgacca aacatgccgt gttaggttac 900 
tcaaacgctt tgcggatgga gctttcggga accggcattt atgtgacaac agtcaacccg 960 
ggcccgattc agacggactt tttttccatt gctgataaag gcggggacta cgccaaaaat 1020 
gtcggccgct ggatgcttga tcctgatgac gtggcagctc aaattacagc tgcaattttt 1080 
acgaaaaagc gggagatcaa tcttccgcgt ttaatgaatg ccggcactaa gctgtatcag 1140 
ctgtttccag ctcttgtaga aaagctggca ggacgcgcgc tcatgaaaaa ataatgatag 1200 
aactgcctgt ggtggagtgg cttgtttctc acggggcagt ttttgatagt ggaagggaga 1260 
gattgttgaa tgtcagttca ttcagaagtc cttcatgctc tgcttaaaga tccgtttatt 1320 
cagaaactga ttgatgcaga gcctgtattc tgggcaaatt caggcaagaa agaggggcca 1380 
ttaccccgtg cagatgagtg ggcaaccgag atagcggaag cggaaaaaag aatgcagcgg 14 40 
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tttgcacctt acattgccga ^ggtgtttcct gagacgaaag gcgctaaagg aatcatcgag 1500 

tctccgcttt ttgaggtgca gcatatgaag ggaaagctgg aagcggcata tcagcagcca 1560 

tttcccggaa gatggctttt aaagtgcgac catgagcttc cgatttcagg atcgattaaa 1620 

gcgaggggcg ggatttatga agtgttaaag tatgctgaaa atctcgcgct tcaagaagga 1*680 

atgcttcagg aaaccgatga t^accgcatc ttacaggaag agcggtttac cgggt-tAtt-c 1740 

tcccgctatt cgattgctgt cggttcgaca ggaaatctag gt<t*aagcat cggcat<:atc 1800 

ggcgcggcac tcgggtttcg cgtgacagtg catatgt-ccg ocgatgctaa gcagtggaaa I860 

aaggatctcc tccgccaaaa gggagtcact gttatggagt acgaaacaga ttacagtgaa 1920 

gcggtgaacg aagggagacg gcaggcggaa caagattrcat tctgttattt tattgatgat 1980 

i 

gaacattctc gtcagctgtt cttaggatat gctgttgctg caagccgatt aaaaacacag 2040 
cttgactgta tgaatataaa gccaagtctt gagacgccct tgtttgtgta tctgccgtgc 2100 
ggagtcggcg gaggaccggg cggtgtagca tttgggctga agcttt.tata cggagatigat 2160 
gttcatgtgt ttttcgcaga accaact<:at tcaccttgta tgctgttagg gctttatt-ca 2220 
ggacttcacg agaagatctc cgtccaggat a.tcggcctgg ataafccagac ggctgctgac 22«80 
ggacttgccg tagggaggcc gtcaggatt-t gt-cggcaagc tgattgaacc gcttctgagc 2340 
ggctgttata cggtagagga caatacgctt tatactt<tgc ttcatatgct ggctgtat^ct 2400 
gaagataaat atttagagcc ctctgctctt gctggcatgt tcgggccggt tcagcttttt 2460 
tcgacagaag agggaaggcg ctatgct<:ag aaatataaga tggaacatgc <:gtacatgtc 2520 
gtctggggaa cgggaggaag -catggttcca aaagatgaaa tggctgcgta taaccgaatc 2580 
ggtgctgatt tgctaaaaaa acgaaatgga aaataagcag acagtgaaaa ggttttccgt 5640 
tacaatcttt gtaagggttt taacctacag agagfccaggt gtaaacagtg aaaaataaag 2700 
aacttaacct acatacttta tatacacagc acaatcggga gtcttggtct ggttttgggg 2760 
ggcatttgtc gattgctgta tctgaagaag aggcaaaagc -tgtggaagga ttgaatgat* 2820 
atctatctgt tgaagaagtg gagacgatct atatt<^cgct tgttcgcttg cttcatttac 2880 
atgtcaagtc tgcggctgaa cgcaataagc atgtcaatgt ttttttgaag <:acccacatt 2940 
cagccaaaat tccgtttatt atcggcattg ccggcagtgt cgcagtcgga aaaagcacga 3000 
cggcgcggat cttgcagaag ctgctttcgc gtttgcctga ccgtccaaaa gtgagcctta 3060 
tcacgacaga tggtttttta tttcctactg ccgagctgaa aaagaaaaat atgatgtcaa 3120 
gaaaaggatt tcctgaaagc tatgatgtaa aggcgctgct -cgaatttttg aatgaottaa 3180 
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aatcaggaaa ggacagcgta aaggccccgg 
aaggtgtgtt cgaggttgta gaacaggcgg 
ttcagtcgcc caccttggag gatgaccggg 
1 ttgatttttc gatttatgtg gatgcggagg 
gttttcgcct gcttcgggaa acagcttttc 
aagacttgtc cgatcaggag gctgacgaga 
ggccgaattt atatgaaaat attttgccaa 
agggagacgg gcataaggtc gaggaagtgt 
tcgagcaata gttaccctta ttatcaagat 
aatcctttaa aaaaacacaa aagaccacat 
agcacccatt agttcaacaa acgaaaattg 
ttatgttaca gtaatattga cttttaaaaa 
gtaagcctcc taaattcact ttagataaaa 
aaaattgatt tagacaattg gaagagaaaa 
acgactttta gtataaccac agaaattgat 
gaaggatata aattttaccc tgcatttatt 
acagctttta gaactggtta caatagcgac 
ccactttata caatttttga tggtgtatct 
aagaatgact tcaaagagtt ttatgattta 
tcggggaaat tgtttcccaa aacacctata 
ccatggactt catttactgg gtttaactta 
cccattatta cagcaggaaa attcattaat 
ttacaggtac atcattctgt ttgtgatggt 
caggaattgt cagataggcc taatgactgg 
ctttttacag tcggttttct aatgtcacta 
tattacagct gtcgactcgt gatcttcgga 
cagctgcgct tttcggtttt tcgcatactt 
ggcaaatata ataaatacaa acagctgaaa 
ctcatgtttg cgggagagat tcattctctt 
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tgtattccca tctaacctat gaccgcgagg 3240 
atattgtgat tattgaaggc attaatgttc 3300 
aaaacccgcg tatttttgtt tccgatttct 3360 
aaagccggat tttcacttgg tatttagagc 3420 t 
aaaatcctga ttcatatttt cataaattta 3480 
tggcagcctc gatttgggag agtgtcaacc 3540 
ctaaattcag gtcagatctc attttgcgta 3600 
tggtaaggag ggtatgaaat gtgctgcagc 3660 
aagaaagaaa aggatttttc gctacgctca 3720 
tttttaatgt ggtctttatt cttcaactaa 37 80 
gataaagtgg gatattttta aaatatatat 3840 
aggattgatt ctaatgaaga aagcagacaa 3900 
atttaggagg catatcaaat gaactttaat 3960 
gagatattta atcattattt gaaccaacaa 4020 
attagtgttt tataccgaaa cataaaacaa 4080 
ttcttagtga caagggtgat aaactcaaat 4140 
ggagagttag gttattggga taagttagag 4 2.00 
aaaacattct ctggtatttg gactcctgta 4 260 
tacctttctg atgtagagaa atataatggt 4320 
cctgaaaatg ctttttctct ttctattatt 4380 
aatatcaata ataatagtaa ttaccttcta 4 4 40 
aaaggtaatt caatatattt accgctatct 4500 
tatcatgcag gattgtttat gaactctatt 4560 
cttttataat atgagataat gccgactgta 4 620 
acctgccccg ttagttgaag aaggttttta 4 680 
caggctgttc agctttttct caatgcgatc 4740 
gaagcctgta acagccgcaa agacgacagc 4800 
catcacatca cctatattca tgttcttcac 4860 
ccgtttttta tttaaagcgg cttttccaga 4 920 
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cgggaacggt gttttgt<ggt ctccattttc atttgccgat aggcgaacgc taaaaatggc 49S0 
aggccgagca gggtaatgcc -gctcaggaca gaaaaaatat aaa<tcggccg gccagcgcca 504 0 
aacaggtcta tacatatccc cccgacccaa gggccgatga cgttt-ccgag ctgtggaaaa 5LQ0 
ccgattgccc cgaaataagt gccttttaat cctggttttg caatctggtc -tacatacaaa -51-60 
tccatcatag agaataaaag cacttcgccg attgtaaatg -tgatgacaat catcacaat* 5220 
gatggaacac cgtgtgatac ggtgaaaatg gccatgct-ga tgctaaccat cacattaccg 5280 
agcatcagag aacaaagcgg cgaaaaccgt ttt-gcaaaat ggacaatggg aaattgcgtc 5340 
gccaacacaa cgattgcgtt taatgt-cagc atcagcccat acagcttcgt tccattgccg 5400 
atcaaggggt tctgcgccat atactgaggg aatgt-ggaac tgaattgtga gtagccgaag 5460 
gtgcatagcg taatgccgac caaagcaatg gtaaaaagat aatccttttg cgtgaccata 5520 
aacgcttccc gcacgctcat atttcgggac <tgggctggtg otgataagga tggatgtttt 5580 
ttaaattgga gggcaagcac aattccgtat agtjecgtaaa tgactgcagg caccaaaaag 5640 
ggcgtagtcg attgcgatga gccgaaatat aggccaagca caggtccgaa gacaaogccg 5700 
atattaatag ccgcatagcg taa'attaaaa actagcagtc t<:gtttt«:tc t-tctgtcata 5760 
tcagacaaca aggcctttga agcgggctca aacagtgatt tgcaaagacc gtttaatgcg 5820 
tttactacaa aaaacaccca gagattagat gctgccgcaa agcctgcaaa «taccagcatc 5880 
catccgaaaa tcgatacaag catcatgttt tttctgccga atttatctga gatatatccg 5940 
ccgtaaaagc ttgcgaggat gccgactgat gagctcgcgg -cgatgaccag ccctgcatag 6000 
gaagctgatg cgccttggac ggctgt<:aaa taaatcgcta aaaaaggaat gctcafccgat 6060 
gttgccattc tgccgaaaat ggttccgatt ataat-tgtac gcgttggatg -catagcttga 6120 
gtattctata gtgt-caccta aatagcttgg cgtaatxratg gtcatagctg tttcctgtgt 6180 
gaaattgtta tccgctcaca attccacaca acatacgagc -cggaa^cata aagtgtaaag 62 40 
cctggggtgc ctaatgagtg agctaactca cattaattgc •gttgcgctca ctgcccgctt 6300 
tccagtcggg aaacctgtcg tgccagctgc attaatgaat -cggccaacgc gcggggagag 6360 
gcggtttgcg tattgggcgc tcttccgctt cctcgct*cac tgactcgctg *cgctcggtcg 6420 
ttcggctgcg gcgagcggta tcagct-cact caaaggcggt aatacggtta tccacagaat 6480 
caggggataa cgcaggaaag aacatgtgag caaaaggcca gcaaaaggcc aggaaccgta 6540 
aaaaggccgc gttgctggcg tttttcgata ggctocgccc <:cctgacgag <:atcacaaaa 6600 
atcgacgctc aagtcagagg tggcgaaacc cgacaggact ataaagatac ca^gcgtttc 6660 
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actgatcttc 


agcatctttt 


act t t caeca 


8100 


gcgtttctgg 


gtgagcaaaa 


acaggaaggc 


aaaatgccgc 


aaaaaaggga 


ataagggega 


8160 


cacggaaatg 


ttgaatactc 


atactcttcc 


tttttcaata 


ttattgaagc 


atttatcagg 


8220 


gttattgtct 


catgagcgga 


tacatatttg 


aatgtattta 


gaaaaataaa 


caaatagggg 


8280 


ttccgcgcac 


atttccccga 


aaagtgccac 


ctgtatgcgg 


tgtgaaatac 


cgcacagatg 


8340 


cgtaaggaga 


aaataccgca 


tcaggcgaaa 


ttgtaaacgt 


taatattttg 


ttaaaattcg 


8400 
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ogttaaatat ttgttaaatc agct<atttt ttaaccaata ggccgaaatc ggcaaaatcc ^^O 
cttataaatc aaaagaatag accgagatag ggttgagtgt bgtt<rcagtt tggaacaaga -8520 
gtccactatt aaagaacgtg gactccaacg tcaaagggcg aaaaaccgtc tatcagggcg 8560 
atggcccact acgtgaacca tcacccaaat xraagtttttt gcggtcgagg tgccgtaaag 8-640 
ctctaaatcg gaaccctaaa gggagccccc gatttagagc ttgacgggga aagccggcga 87O0 
acgtggcgag aaaggaaggg aagaaagcga aaggagcggg cgctagggcg ctggcaagtg 87*60 
tagcggtcac gctgcgcgta accaccacac ccgccgcgct taa -8803 

<210> 73 
<211> 8320 

<212> DNA , 
<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Recombinant 
pAN296 plasmid 

<400> 73 

tgcgccgcta cagggcgcgt ccattcgcca ttcaggctgc gcaactgttg ggaagggcga 60 
tcggtgcggg cctcttcgct attacgccag <:tggcgaaag ggggatgtgc tgcaaggc^fa 120 
ttaagttggg taacgccagg gttttcccag tcacgacgtt gtaaaacgac ggccagtgaa 180 
ttgtaatacg actcactata gggcgaattg ggcccgacgt cgcatgctgg atgaaaagcc 240 
gatgaccgct tttcaggtct gtcagcagct ttttcctgct gtatatgaaa aggaattgtt 300 
tttaacgatg tcagaaacgg caggtcacct tgatgtgttg gaggctgaag aagccatcac 360 
gtcatattgg gaaggaaata ccgtatactt taaaacaatg aagaggtgaa atgggt<gaaa 420 
catatagcgg gaaaaaggat ttggataacc ggcgcttcag gagggcttgg agaaagaatc 480 
gcatacttat gcgcggctga aggagcccat gtcctgctgt -cggctagacg cgaggatcgt 540 
ttgatagaaa tcaaaaggaa aataaccgag gaatggagcg gacagtgtga gatttttcct -600 
ctggatgtcg gccgcctaga ggatatcgcc cgggtccgcg atcagatcgg ctcgattgat 660 
gtactgatta acaatgcagg cttcggtata tttgaaacgg ttttagactc tacatt<gga-t 720 
gacatgaaag cgatgtttga tgtgaatgtc ttcggcctga tcgcctgtac aaaagcggtg 780 
cttccgcaaa tgcttgagca aaaaaaggga catat-catca atatcgcctc tcaagcgggg 840 
aaaatcgcca caccgaagtc tagcctgtat tccgcgacca aacatgccgt gttaggttac 900 
tcaaacgctt tgcggatgga gctttcggga accggcattt atgtgacaac agtcaacccg 960 
ggcccgattc agacggactt ttttfcccatt gctgataaag gcggggacta <:gccaaaaat 1020 
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gtcggccgct ggatgcttga tcctgatgac 
acgaaaaagc gggagatcaa t<:ttccgcgt 
ctgtttccag ctcttgtaga aaagctggca 
aactgcctgt ggtggagtgg cttgtttctc 
gattgttgaa tgtcagttca ttcagaagtc 
cagaaactga ttgatgcaga gcctgtattc 
ttaccccgtg cagatgagtg ggcaaccgag 
tttgcacctt acattgccga ggtgtttcct 
tctccgcttt ttgaggtgca gcatatgaag 
tttcccggaa gatggctttt aaagtgcgac 
gcgaggggcg ggatttatga agtgttaaag 
atgcttcagg aaaccgatga ttaccgcatc 
tcccgctatt cgattgctgt cggttcgaca 
ggcgcggcac tcgggtttcg cgtgacagtg 
aaggatctcc tccgccaaaa gggagtcact 
gcggtgaacg aagggagacg gcaggcggaa 
gaacattctc gtcagctgtt cttaggatat 
cttgactgta tgaatataaa gccaagtctt 
ggagtcggcg gaggaccggg cggtgtagca 
gttcatgtgt ttttcgcaga accaactcat 
ggacttcacg agaagatctc cgtccaggat 
ggacttgccg tagggaggcc gtcaggattt 
ggctgttata cggtagagga caatacgctt 
gaagataaat atttagagcc ctctgctctt 
tcgacagaag agggaaggcg ctatgctcag 
gtctggggaa cgggaggaag catggttcca 
ggtgctgatt tgctaaaaaa acgaaatgga 
tacaatcttt gtaagggttt taacctacag 
aacttaacct acatacttta tatacacagc 



gtggcagctc aaattacagc tgcaattttt 1080 
ttaatgaatg ccggcactaa gctgtatcag 1140 
ggacgcgcgc tcatgaaaaa ataatgatag 1200 
acggggcagt ttttgatagt ggaagggaga 1260 
cttcatgctc tgcttaaaga ' tccgtttatt 1320 
tgggcaaatt caggcaagaa agaggggcca 1380 
atagcggaag cggaaaaaag aatgcagcgg 14 4 0 
gagacgaaag gcgctaaagg aatcatcgag 1*500 
ggaaagctgg aagcggcata tcagcagcca 1560 
catgagcttc cgatttcagg at-cgattaaa 1620 
tatgctgaaa atctcgcgct tcaagaagga 1680 
ttacaggaag agcggtttac cgggtttttc 1740 
ggaaatctag gtttaagcat cggcatcatc 1800 
catatgtccg ccgatg[Ctaa gcagtggaaa 1860 
gttatggagt acgaaacaga ttacagtgaa 1920 
caagatccat tctgttattt tattgatgat 1980 
gctgttgctg caagccgatt aaaaacacag 2040 
gagacgccct tgtttgtgta tctgccgtgc 2100 
tttgggctga agcttttata cggagatgat 2160 
tcaccttgta tgctgttagg gctttattca 2220 
atcggcctgg ataatcagac ggctgctgac 2280 
gtcggcaagc tgattgaacc gcttctgagc 2340 
tatactttgc ttcatatgct ggctgtatct 24O0 
gctggcatgt tcgggccggt tcagcttttt 24 60 
aaatataaga tggaacatgc cgtacatgtc 2520 
aaagatgaaa tggctgcgta taaccgaatc 2580 
aaataagcag acagtgaaaa ggttttccgt 2640 
agagtcaggt gtaaacagtg aaaaataaag 2700 
acaatcggga gtcttctgca gctcgagcaa 27 60 
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tagttaccct tattatcaag ataagaaaga aaaggatttt tcgctacgct caaatcct-tt 2820 
aaaaaaacac aaaagaccac attttttaat gtggtcttta ttcttcaact aaagcaccca 2880 
ttagttcaac aaacgaaaat tggataaagt -gggatati:t<: taaaatatat atttatgtta 2940 
cagtaatatt gacttttaaa aaaggatt'ga ttctaatgaa gaaagcagac aagtaagcct 3000 
cctaaattca ctttagataa aaatttagga ggcatatcaa atgaacttta ataaaattga 3060 
tttagacaat tggaagagaa aagagatatt taatcattat *tgaaccaac aaacgact^t 3120 
tagtataacc acagaaattg atattagtgt tttataccga aacataaaac aagaaggata 3180 
taaattttac cctgcattta ttttcttagt gacaagggtg ataaactcaa atacagctt* 3240 
tagaactggt tacaatagcg acggagagtt aggttattgg gataagttag agccacttta 3300 
tacaattttt gatggtgtat ctaaaacatt ctctggtatt tggactcctg taaagaatga 3360 
cttcaaagag ttttatgatt tatacctttc £gatgtagag aaatataatg gttcggggaa 3420 
attgtttccc aaaacaccta tacctgaaaa tgctt<tt>tct ctttotatta tt<x:atggac 34 80 
ttcatttact gggtttaact taaatatcaa taataatagt aattaccttc tacccattat 3540 
tacagcagga aaatt-catta ataaaggtaa ttcaatatat ttaecgctat ctttacaggt 3600 
acatcattct gtttgtgatg gttatcatgc aggattgttt atgaactcta ttcaggaatt 3*660 
gtcagatagg cctaatgact ggcttttata atatgagata atgccgactg tactttttac 3720 
agtcggtttt ctaatgtcac taacctgccc cgttagbtga agaaggtttt tatattacag 3780 
ctgtcgacta aggtcgagga agtgttggta aggagggtat gaaatgtgca toatattgaa 3840 
ctgtatgtct ctgatttgga ggcgtctagg cggttttggg gctggttctt aaaagaactt 3900 
ggttataaag agtatcaaaa atggagctca ggcatcagct ggaagaaaga togtttttac 3960 
ctagtgattg tgcaggcgaa agagccattt ctagagccgg aataccatag atgccgagtc 4020 
ggtctgaacc atctcgcatt tcatgctgaa tccaagcttc aagtcgatca gatgactgaa 4080 
aaattgacgg caaaaggcta tcgtgtgttg taccgagaca -ggcatccttt tgccggagga 4140 
gacgggcatt atgcagtctt ttgtgaggat ccagaccgga ttaaggtaga gctcgttgcc 4200 
ccaagctgtt aatcgtgatc ttcggacagg ctgtt-cagct ttttctcaat gcgatccagc 4-2-60 
tgcgcttttc ggtttttcgc atacttgaag cctgtaacag ccgcaaagac gacagcggca 4 320 
aatataataa atacaaacag ctgaaacatc acafccaccta tattcatgtt -cttcaoctca 4 380 
tgtttgcggg agagattcat tctcttccgt tttttattta aagcggcttt tccagacggg 4440 
aacggtgttt tgtggtctcc attttcattt gccgataggc gaacgctaaa aatggcaggc 4 500 



BNS0CW0: <WO 0121772A2 I > 



WO 01/21772 



-85 - 



PCT7US00/25993 



cgagcagggt aatgccgctc aggacagaaa 
ggtctataca tatccccccg acccaagggc 
ttgccccgaa ataagtgcct tttaatcctg 
tcatagagaa taaaagcact tcgccgattg 
gaacaccgtg tgatacggtg aaaatggcca 
tcagagaaca aagcggcgaa aaccgttttg 
acacaacgat tgcgtttaat gtcagcatca 
aggggttctg cgccatatac tgagggaatg 
atagcgtaat gccgaccaaa gcaatggtaa 
cttcccgcac gctcatattt cgggactggg 
attggagggc aagcacaatt ccgtatagtc 
tagtcgattg cgatgagccg aaatataggc 
taatagccgc atagcgtaaa ttaaaaacta 
acaacaaggc ctttgaagcg ggctcaaaca 
ctacaaaaaa cacccagaga ttagatgctg 
cgaaaatcga tacaagcatc atgttttttc 
aaaagcttgc gaggatgccg actgatgagc 
ctgatgcgcc ttggacggct gtcaaataaa 
ccattctgcc gaaaatggtt ccgattataa 
ttctatagtg tcacctaaat agcttggcgt 
attgttatcc gctcacaatt ccacacaaca 
ggggtgccta atgagtgagc taactcacat 
agtcgggaaa cctgtcgtgc cagctgcatt 
gtttgcgtat tgggcgctct tccgcttcct 
ggctgcggcg agcggtatca gctcactcaa 
gggataacgc aggaaagaac atgtgagcaa 
aggccgcgtt gctggcgttt ttcgataggc 
gacgctcaag tcagaggtgg cgaaacccga 
ctggaagctc cctcgtgcgc tctcctgttc 



aaatataaat cggccggcca gcgccaaaca 4 560 
cgatgacgtt tccgagctgt ggaaaaccga 4 620 
gttttgcaat ctggtctaca tacaaatcca 4 680 
taaatgtgat gacaatcatc acaattgatg 4740 
tgctgatgct aaccatcaca ttaccgagca 4 800 
caaaatggac aatgggaaat tgcgtcgcca 4860 
gcccatacag cttcgttcca ttgccgatca 4 920 
tggaactgaa ttgtgagtag ccgaaggtgc 4 98C 
aaagataatc cttttgcgtg accataaacg 5040 
ctggtgctga taaggatgga tgttttttaa 5100 
cgtaaatgac tgcaggcacc aaaaagggcg 5160 
caagcacagg tccgaagaca acgccgatat 5220 
gcagtctcgt tttttcttct gtcatatcag 5280 
gtgatttgca aagaccgttt aatgcgttta 5340 
ccgcaaagcc tgcaaatacc agcatccatc 5400 
tgccgaattt atctgagata tatccgccgt 54 60 
tcgcggcgat gaccagccct gcataggaa<j 5520 
tcgctaaaaa aggaatgctc atcgatgttg 5580 
ttgtaacgcg ttggatgcat agcttgagta 5640 
aatcatggtc atagctgttt cctgtgtgaa 5700 
tacgagccgg aagcataaag tgtaaagcct 5760 
taattgcgtt gcgctcactg cccgctttcc 5820 
aatgaatcgg ccaacgcgcg gggagaggcg 5880 
cgctcactga ctcgctgcgc tcggtcgttc 594 0 
aggcggtaat acggttatcc acagaatcag €000 
aaggccagca aaaggccagg aaccgtaaaa 6060 
tccgcccccc tgacgagcat cacaaaaatc 6120 
caggactata aagataccag gcgtttcccc 6180 
cgaccctgcc gcttaccgga tacctgtccg 6240 
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cctttctccc ttcgggaagc gtggcgcttt ctcatagctc acgctgtagg tatctcagtt 6300 
cggtgtaggt cgttcgctcc aagctgggct gtgtgcacga accccccgtt cagcccgacc 6360 
gctgcgcctt atccggtaac tafcegtcttg agtccaaccc ggtaagacac <gacttatogc 6420 
cactggcagc agccactggt aacaggatta ^cagagcgag gtatgtaggc ggtgctacag €480 
agttcttgaa gtggtggcct aactacggct acactagaag -gacagtattt ggta<x:tgcg 654*0 
ctctgctgaa gccagttacc ttcggaaaaa gagttggtag -ctcttga4>cc ggcaaacaaa €600 
ccaccgctgg tagcggtggt ttttttgttt gcaagcagca gattacgcgc agaaaaaaag 6660 
gatctcaaga agatcctttg afrcttttcta cggggtctga cgct<:agtgg aacgaaaact 6720 
cacgttaagg gattttggtc atgagattat caaaaaggat cttcacct,ag atccttttaa €780 
attaaaaatg aagttttaaa tcaatctaaa gtatatatga gtaaacttgg tctgacagtt -6840 
accaatgctt aatcagtgag gcacctatct cagcgatctg tctattfccgt tcatccatag 6900 
ttgcctgact ccccgtcgtg tagataacta cgatacggga gggcttacca tctggcccca 6960 
gtgctgcaat gataccgcga gacccacgct caceggctcc agatttatca gcaataaacc 7020 
agccagccgg aagggccgag cgcagaagtg gtcctgcaac tttatccgcc *ccabccagt 7080 
ctattaattg ttgccgggaa gctagagtaa gtagttcgcc agttaatagt ttgcgcaacg 7140 
ttgttggcat tgctacaggc atcgtggtgt <:acgctcgtc gtttggtatg gctt«:attca 7200 
getccggttc ccaacgatca aggcgagtta -catgatcccc catgttgcgc aaaaaagcgg 7260 
ttagctcctt cggtcctccg at<cgttgtca gaagtaagtt ggccgcagtg ttatcactca 7 320 
tggttatggc agcactgcat aattctctta ctgtcat-gcc atccgtaaga tgcttttctg 7380 
tgactggtga gtactcaacc aagtcattct gagaataccg .cgcccggcga cogagttgct 74 40 
cttgcccggc gtcaatacgg gataatagtg tatgacatag <*gaact*ta aaagtgctca 7500 
tcattggaaa acgttcttcg gggcgaaaac tctcaaggat cttaccgctg ttgagatcca 7560 
gttcgatgta acccactcgt gcacccaact gatcttcagc atcttttact ttcaccagcg 7620 
tttctgggtg agcaaaaaca ggaaggcaaa atgccgcaaa aaagggaata agggcgacac 7680 
ggaaatgttg aatactcata ctcttccttt t-tcaatatta ttgaagcatt tetcagggtt 7740 
attgtctcat gagcggatac atatttgaat gtatttagaa aaataaacaa ataggggttc 7800 
cgcgcacatt tccccgaaaa gtgccacctg *atgcggt^t gaaataccgc acagatgcgt 78-60 
aaggagaaaa taccgcatca ggcgaaattg taaacgttaa tattttgtta aaattcgcgt 7920 
taaatatttg ttaaatcagc tcatttttta accaataggc cgaaatcggc aaaatccctt 7980 
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ataaatcaaa agaatagacc gagatagggt tgagtgttgt tccagtttgg aacaagagtc 8040 

cactattaaa gaacgtggac tccaacgtca aagggcgaaa aaccgtctat cagggcgato, 8100 

gcccactacg tgaaccatca cccaaatcaa gttttttgcg gtcgaggtgc cgtaaagctc 8160 

taaatcggaa ccctaaaggg agcccccgat ttagagcttg acggggaaag ccggcgaacg 8220 

tggcgagaaa ggaagggaag aaagcgaaag gagcgggcgc tagggcgctg gcaagtgtag 8280 

cggtcacgct gcgcgtaacc accacacccg ccgcgcttaa 8 320 



<210> 74 
<211> 250 
<212> PRT 

<213> Clostridium acetobutylicum 
<400> 74 

Asn Lys Arg Ala Ala Phe Met Leu Leu Leu Phe Leu Arg Ser Val Leu 
1 5 10 , 15 

Lys Val He Leu Val Leu Asp Val Gly Asn Thr Asn He Val Leu Gly 
20 25 30 

He Tyr Asn Asp Thr Lys Leu Thr Ala Glu Trp Arg Leu Ser Thr Asp 
35 4 0 45 

Val Leu Arg Ser Ala Asp Glu Tyr Gly He Gin Val Mat Asn Leu Phe 
50 55 60 

Gin Gin Asp Lys Leu Asp Pro Thr Leu Val Glu Gly Val He He Ser 
65 70 75 80 

Ser Val Val Pro Asn He Met Tyr Ser Leu Glu His Met He Arg Lys 
85 90 95 

Tyr Phe Lys He Asn Pro Leu Val Val Gly Pro Gly He Lys Thr Gly 
100 105 ' 110 

He Asn He Lys Tyr Asp Asn Pro Lys Glu Val Gly Ala Asp Arg He 
115 120 125 

Val Asn Ala Val Ala Ala His Glu He Tyr Lys Arg Ser Leu He He 
130 135 140 

He Asp Phe Gly Thr Ala Thr Thr Phe Cys Ala Val Arg Glu Asn Gly 
145 150 " 155 " 160 

Asp Tyr Leu Gly Gly Ala He Cys Pro Gly He Lys Val Ser Ser Glu 
165 170 175 

Ala Leu Phe Glu Lys Ala Ala Lys Leu Pro Arg Val <31u Leu lie Lys 
180 185 190 

Pro Ala Tyr Ala He Cys Lys Asn Thr He Ser Ser He Gin Ser Gly 
195 200 205 
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He Val Tyr Arg Tyr Leu Arg Cln Val Lys Tyr Leu* -Phe .Glu Lys Leu 
210 215 220 

Lys Glu Asn Leu Pro Asp t^ly Arg Arg Thr Arg T-hr Ser Leu Val Leu 

225 230 * 235 240 

Ala Thr Gly Gly Leu Ala Lys Leu He Asn f 
245 250 



<210>. 75 
<211> 258 
<212> PRT 

<213> Rhodobacter capsulatus 
<400> 75 

Met Leu Leu Cys He Asp Cys *Gly Asn Thr Asn Thr Val Phe ^Ser Val 
1 5 * 10 15 

Trp Asp Gly Thr Asp Phe Ala Ala Thr Trp Arg He Aia Thr Asp His 
20 25 30 

Arg Arg Thr Ala Asp Glu Tyr Phe Val Trp Leu Asn Thr Leu Met Cln 
35 40 45 

Leu Lys Gly Leu Gin Gly Arg He. Ser Glu Ala lie. He Ser Ser Thr 
50 55 60 

Ala Pre Arg Val Val Phe Asn Leu Arg Val Leu Cys Asn Arg Tyr Phe 
65 70 75 8C 

Asp Cys Arg Pro Tyr Val Val Cly Lys P*ro Gly -Cys Glu Leu Pro Val 
85 90 95 

Ala Pro Arg Val Asp Pro Gly Thr Thr Val -Gly Pro Asp Arg Leu Val 
100 105 11-0 

Asn Thr Val Ala Gly Tyr Asp Arg His Gly Gly Asp Leu He Val Val 
115 120 125 

Asp Phe Gly Thr Ala Thr Thr Phe Asp Val Val Ala Pro Asp Gly Ala 
130 135 140 

Tyr He Gly Gly Val He Ala Pro Gly Val Asn Leu "Ser Leu ^Glu Ala 
145 150 155 160 

Leu His Met Ala Ala Ala Ala Leu Pro His Val Asp Val Thr Lys Pro 
165 170 175 

Gin Gly Val He Gly Thr Asn Thr Val Ala Cys He Gin Ser Cly Val 
180 185 190 

Tyr Trp Gly Tyr He Gly Leu Val Glu 131 y He Val Arg CLn He Arg 
195 200 205 

Met Glu Arg Asp Arg Pro Met Lys Val He Ala Thr -Gly -Gly Leu Ala 
210 215 220 
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Ser Leu Phe Asp Leu Gly Phe Asp Leu Phe Asp Lys Val Glu Asp Asp 
225 230 ( 235 240 

Leu Thr Met His Gly Leu Arg Leu lie Phe Asp Tyr Asn Lys Gly Leu 
245 250 ' 255 

Gly Ala 

<2K» 76 
<211> 10801 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Recombinant 
pAN240 plasmid 

<400> 76 



gaattttgcg 


gccgcttcga 


aagctgtaat 


ataaaaacct 


tcttcaacta 


acggggcagg 


60 


ttagtgacat 


tagaaaaccg 


actgtaaaaa 


gtacagtcgg 


cattatctca 


tattataaaa 


120 


gccagtcatt 


aggcctatct 


gacaattcct 


gaatagagtt 


cataaacaat 


cctgcatgat 


180 


aaccatcaca 


aacagaatga 


tgtacctgta 


aagatagcgg 


taaatatatt 


gaattacctt 


240 


tattaatgaa 


ttttcctgct 


gtaataatgg 


gtagaaggta 


attactatta 


ttattgatat 


300 


ttaagttaaa 


cccagtaaat 


gaagtccatg 


gaataataga 


aagagaaaaa 


gcattttcag 


360 


gtataggtgt 


tttgggaaac 


aatttccccg 


aaccattata 


tttctctaca 


tcagaaaggt 


420 


ataaatcata 


aaactctttg 


aagtcattct 


ttacaggagt 


ccaaatacca 


gagaatgttt 


480 


tagatacacc 


atcaaaaatt 


gtataaagtg 


gctctaactt 


atcccaataa 


cctaactctc 


540 


cgtcgctatt 


gtaaccagtt 


ctaaaagctg 


tatttgagtt 


tatcaccctt 


gtcactaaga 


600 


aaataaatgc 


agggtaaaat 


ttatatcctt 


cttgttttat 


gtttcggtat 


aaaacactaa 


€60 


tatcaatttc 


tgtggttata 


ctaaaagtcg 


tttgttggtt 


caaataatga 


ttaaatatct 


720 


cttttctctt 


ccaattgtct 


aaatcaattt 


tattaaagtt 


catttgatat 


gcctcctaaa 


780 


tttttatcta 


aagtgaattt 


aggaggctta 


cttgtctgct 


ttcttcatta 


gaatcaatcc 


840 


ttttttaaaa 


gtcaatatta 


ctgtaacata 


aatatatatt 


ttaaaaatat 


cccactttat 


900 


ccaattttcg 


tttgttgaac 


taatgggtgc 


tttagttgaa 


gaataaagac 


cacattaaaa 


960 


aatgtggtct 


tttgtgtttt 


tttaaaggat 


ttgagcgtag 


cgaaaaatcc 


ttttctttct 


1020 


tatcttgata 


ataagggtaa 


ctattgaatt 


cggtaccaag 


agtttgtaga 


aacgcaaaaa 


1080 


ggccatccgt 


caggatggcc 


ttctgcttaa 


tttgatgcct 


ggcagtttat 


ggcgggcgtc 


1140 
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ctgcccgcca ccctccgggc cgttgcttcg 
tcctactcag gagagcgttc accgacaaac 
actgagcctt tcgttttatt tgatgcctgg 
acactaccat cggcgctacg gcgtttcact 
accgcgctac tgccgccagg caaattctgt 
atctgtatca ggctgaaaat cttct-ctcat 
cggcaaaacc gttttactct caaaatctta 
ttctacaaaa ttgtacgggc tggttcgttc 
ttcagaacag ccactttcgg ctcatggctt 
gaaataataa tgaccttatc tccttcctgc 
atgacgccgc ttccccgttt accaggaata 
ttattcacaa tttgtacttt ttcattagga 
tcaattgtaa tgcttcccac atagttcagg 
ttqccgctca tcattgttcg atacatatta 
gatattatct attaaacgcg cttttgaaaa 
agcaatttca ttcacaggct cgagttccgg 
tccgctagtc gtttcaatga -tatcttttgc 
accggcttgg acaagttccg cacttgtttg 
tctttcctca gctgttaagt atacattgcg 
gaccgtatcg acaggaacca att-caatatc 
aacagctacc tgctgcgcat cttttaaacc 
gaaaagcttc gtcagtacga tcgcgacccc 
taacacgtct gtgcgtcttt ctacatgaat 
atgagcatct ggcgtaaaaa gaatatcgac 
ctcaatatcg cgcggatatg ^ttcaaaatc 
aaaaatactc ataataacgg cgtcgttttc 
cccctcatgc agaaacccca tcgtcggaac 
ttgttttatg gcttctttca gctgtgaaat 
<:gtacaagcc gtcaagcact gtctggttca 
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caacgttcaa at<:cgctccc ggcggatttg 1200 
aacagataaa acgaaaggcc <^gtctttcg 1260 
•cagttoccta <rtctcgcatg gggagacccc 1320 
t<:tgagtt<g gcatggggtc aggtgggacc 1380 ' 
tttatcagac cgcttctgcg t^ctqattta 14 4 0 
ccgccaaaac aggatcctac ggaaatggag 1500 
aaagaaaacc cccgataaag nggggcttttc 1560 
occagcattt gttcaattt* gttttgatca 1620 
gccgcttctt gatcagacat cattttgtag 1*680 

acaaggcgtg *cggctgcacc gtttaagcat 1*7 40 

i 

atatacgttt «caagacgtgc tccattatta 1800 
agcattccca cagcatcaat gagatcttca 1860 
tttgcttocg taacagttgc oct<gtg[aagt 1920 
tattctotcc atttctcgaa -tatcaataat 1*990 
agcaactgca acagcgagaa fccatctttcc 2040 
ataggaataa agctctacat agtctatggt 2100 
agcttttatc accgctticag gatctxrtttc 2160 
aagggcccga tacagcttag gcgcttcttt 2220 
agagcttttg gctaagccgt cttcotctct 2280 
<:atgaagaag tcgctgatta acccatcaac 2340 
gaaataggca cgagtcggct tgactagatt 24O0 
gtcaaaatgt cctt-ctcttg agcgcccgca 2460 
^cgtgacattc ttttcaccgg gatacatatc 2520 
tccggcgttt t-ctgcaagag ctgcatcccg 2580 
tt-cattaggg ccgaattgtg caggattcac 2640 
ttgtcttgct ttgtctgcta aggttaaatg 27O0 
aaatccgatt gacttgccct -ctgaatggta 27^0 
afrcagtaatc tgtctcatct tattttcccc 2820 
tttgaaagga atgcttttgt tcagggaaag 2880 
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cacgatgtct 


tacatcctga 


acatatccgc 


gcgtatattg 


ctttacaaat 


ttaggtgttc 


aaacgagaac 


ttgtccgtcc 


gctttcacac 


gcgtctcggc 


aattttggct 


gtgagttctg 


ctcctgcttc 


ttcgcatttt 


atactgtctt 


tgccctgtac 


tttatagccg 


cccagtacgc 


tgactactgg 


aatgcctcca 


agcgtcaatg 


cctcaagctt 


cagtgcgtca 


gctccgcttt 


tatcttcctt 


agacaggtga 


taagacataa 


gcgcacccct 


tttaacggct 


tttgtatgat 


ttgaatcaag 


gccgaggacg 


accattccaa 


ccgcttgttc 


agcaagttta 


gctgccggat 


cttcagactc 


cttcattttt 


agaaaatcca 


gagcgtcctg 


ctgttgttaa 


gattattata 


ttgcaaaatt 


tttcaggaat 


tttagcagag 


ccgctcttgt 


gctgttagga 


tatctttctt 


ttggttaaaa 


ggaaacaaaa 


agaccgtttt 


tacagtcaca 


gccacttttg 


caaaaaccgg 


gtcgacgatg 


atctgccgtt 


ttcttctgca 


gattcttcac 


tttctaaagt 


tcggcgagtt 


caaggcagac 


tgctgcaatg 


tctatctatt 


tgcccaatgg 


aagtatacca 


aaatcaacgg 


atcggcagaa 


tagatttttt 


taatgccttc 


atctatacct 


aacgccttac 


cgtaaaaggt 


gccaataccg 


agcgcatagc 


tttcccataa 


ataatcccgg 


taccgtttct 


caaagcatag 


aattttttcc 


ccagcagctt 


ggctgaggct 


atcattaggc 


tgctggttaa 


cgttaatgcc 


ttcttcagcc 


tgcatttccg 


ttaggatacc 
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tgattgctgt ttcgatggtt tcatcaatgc 2940 
tctcaacacc gtggccgata ajtatcatgat 3000 
cagccccgat tccaatgacc ggtatgctta 3060 
ccggcacaca ttccagcaca agcatcatag 3120 
ctattaattt tttggcgctt tgttcgtctt 3180 
cgactgactg cggtgtcaaa cctaagtgac 3240 
cgcgaatgga ttcaaacacg ccttctccgc 3300 
cctgaacgat agccgctgca tttttcagcg 3360 
acggcatatc tgtcacaata aaggtattcg 3420 
ggatcatgtc cgcaactfgtc acaccgacag 3480 • 
gtgaatcacc gactaaaatc atgtcaactc 3540 
aatcataagc ggtcagcatg acaatcggtt 3600 
gttttgtttt catgttttct cctccttcta 3660 
ccacaccttg tagataaagt caacaacttt 3720 
gttgttctgg atgtagaaca aaacatcttt 3780 
ggaagctagg taggcctcga gttatggcag 3840 
cacacaaaac ggtctttttc gatttctttt 3900 
acagcttcat gccttataac tgctgtttcg 3960 
agccaaaaaa ccttccgtta caacgagaag 4020 
tqatccctct gtcccagtcc ttttttggat 4080 
ttaataatag gtgcagttcg caggcgatac 4140 
gcttgtacca acacattagc ccaattcgat 4200 
gttcgtttct aaaagcagaa cgccttcatc 4260 
tccgtttaac gttctggctc tcatattagt 4320 
aagcttaatc ggcgtaaatc cgtgcgtcat 4380 
taaaatatgc tggatgacgc cggcccgatc 44 40 
tgtcgcgatg tccttcaatt catctggaaa 4 500 
gatcccaatg atcactgaac gtacgcggtc 4560 
gactgttttt tttccgttaa tcaaaatatc 4620 
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atttggccat ttaatat>ccg tttggatgcc fcgctgcctct tctattccct gcacaacagc 4680 
tactgcagca agcagagtca gctgcggtgt tttttggagc ggaatgfccag ^ccgcaaaat 4 740 
caggctcatc caaacaccgt ttccttcttg agaatgccat accctagaca ttcggcccct 48O0 , 
tccggctgtt tgtttgtcag ccaccacaag ggtgccttcc ggtgcgttat ^attcgcgag 4€60" 
ctcatgagcc gttttttgcg tgcttgaaag aacgtcatgg taaataagat gct^gcccat 4920 
cacttccgtt tttaatccaa aacgaatttc gctttcactg a<gttttccgg gttttttgat 4980 
gagccgatat ccttttcttc taacggcttc tacttcataa cccfectttcc gaagctcttc 5040 
aatatgcttc cacacagcag t^cttgaaca gccgagagca tcactgattt tttggccgga 51O0 
aataaattca ttgccggcct gagaaaataa tt-caataagg tcttttctta atgtt^raccg 51-60 
catgtcttca gccactcctc tatgtgtttc ttttgattgg agagcttccc tgtcacaaca 52,20 
gcctgctcga tccactgtaa ttcttctgac acccattttc cggccggccg gtttcgaa-gc 5280 
gcaageaagt ccttacccgt gatatcaaga tccttaaggc ttttgatcgg caggttttga 534 0 
, taagcgtact gaatgtcctt cagtttcttt tcatccagtt tttcgttttg ccgaagctgc 5400 
gatattttgg ccgctgagag cagtgctttt ttcccagctc tgtacattgt cattgcgt-ca 54-60 
aggctctggc caaacgtatc ggcaatgtga atggcttcct tgafccacttt tcccgggagc 5520 
ttccaggctt tcaggaaaag gggcgcgtct ttcaaaacta -tgccaaggtt aa*taaaaga 5580 
gcagcccaaa gctcct<:acg ggatgttaaa gagaagaatg gaaactcact -cgttgaaatc 5640 
aggttttctc gtttatgata aaaaccagga agctcttcat acaatctcgt ttgaa4:gagt 5700 
gtttgaagcg cctggcgaga agctcttccc t^gcagcaatt tct<:aaactc tatagttttt 5760 
cgttcgactg aaacatggga gaggagt^gat ttttctttcg caatggcttc ttctgtttcc 5820 
ggtgaaagcg taaagccaag ctggctcata aagcgtacgg -ctctcagcat acgaagcgca 5880 
tcctcttgaa atctatcctc aggctttcca acggttcgaa t-cactttctg atcaatatct 5940 
ttcttgccgc caaaataatc aagcaccttc -ccgtccgctg tcatggccat cgcat.tgatc 6000 
gttaaatctc tgcgttttag atcctcttct aatgatgaga taaattgcac ttctgacggt 6060 
cttctgaaat caacataatc agattcagtc cggaatgtcg tgactfccata gcjtttcatcc 6120 
tcccagagca caataatggt cccgtgctct ttgcctacat caacagtccg ctgaaacagc 6180 
cgttctactt gatcaggtgc cgcatctgtc gcgata-tcga catctccgat ogttcgtttc -6240 
atatagctgt cacgaactgc gcccccgaca aaataagcct gatggcccgc ttcgattaag 6300 
atgcggagca cgggaagtgc tttgataaaa actttttcca tgtgatcact ccggttctgc ^360 
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taaatcggca taaatctgtt catactggct gacaattttt ttagaagaaa attcattttc 6420 
aagcatctct attgccgcct ttgtaaaacg attgcttagc tgttcatctt ctaaaatgct 6480 
catcgcgcgg gctgtt^cgg ccgtaacatc accgacatcc accaaaaatc cgctcacatt *540 
gttttttata acctcaggga taccgccaat gtttgttcca atacaaggca ctccgcaagc 6600 
catcgcttca agcaggacaa ggccaaagct ttctttttca gatagcagca gcttcaaatc 6660 
gctaatagaa taaagatctt caacacggtc ttgatttcca agcattaaga cttggtcttc 6720 
caagccatat tttctgataa gctcgcaggc tgtcgatttc tccggaccgt ctccgactaa 6780 
aagcagcttc gctttcgttt tgccagcgat attgcggaac acacggatga catcctgcac 6840 
gcgtgcagcc actggtaaca ggattagcag agcgaggtat gtaggcggtg ctacagagtt 6900 
cttgaagtgg tggcctaact acggctacac tagaaggaca gtattt^gta tctgcgctct 6960 
gctgaagcca gttaccttcg gaaaaagagt tggtagctct tgatccggca aacaaaccac 7020 
cgctggtagc ggtggttttt ttgtttgcaa gcagcagatt acgcgcagaa aaaaaggatc 7080 
tcaagaagat cctttgatct tttctacggg gtctgacgct cagtggaacg aaaactcacg 7140 
ttaagggatt ttggtcatga gattatcaaa aaggatcttc acctagatcc ttttaaatta 7200 
aaaatgaagt tttaaatcaa tctaaagtat atatgagtaa acttggtctg acagttacca 7260 
atgcttaatc agtgaggcac ctatctcagc gatctgtcta tttcgttcat ccatagttgc 7320 
ctgactcccc gtcgtgtaga taactacgat acgggagggc ttaccatctg gccccagtgc 7380 
tgcaatgata ccgcgagacc cacgctcacc ggctccagat ttatcagcaa taaaccagcc 7440 
agccggaagg gccgagcgca gaagtggtcc tgcaacttta tccgcctcca tccagtctat 7500 
taattgttgc cgggaagcta gagtaagtag t.tcgccagtt aatagtttgc gcaacgttgt 7560 
tgccattgct acaggcatcg tggtgtcacg ctcgtcgttt ggtatggctt cattcagctc 7620 
cggttcccaa cgatcaaggc gagttacatg atcccccatg ttgtgcaaaa aagcggttag 7680 
ctccttcggt cctccgatcg ttgtcagaag taagttggcc gcagtgttat cactcatggt 7740 
tatggcagca ctgcataatt ctcttactgt catgccatcc gtaagatgct tttctgtgac 7800 
tggtgagtac tcaaccaagt cattctgaga atagtgtatg cggcgaccga gttgctcttg 7860 
cccggcgtca atacgggata ataccgcgcc acatagcaga actttaaaag tgctoatcat 7920 
tggaaaacgt tcttcggggc gaaaactctc aaggatctta ccgctgttga gatccagttc 7980 
gatgtaaccc actcgtgcac ccaactgatc ttcagcatct tttactttca ccagcgtttc 8040 
tgggtgagca aaaacaggaa ggcaaaatgc cgcaaaaaag ggaataaggg cgacacggaa 8100 



WO 01/21772 



PCTYUS00/25993 



-94- 

atgttgaata ctcatactct tcctttttca atattattga agcatttatc agggttattg 81-60 
tctcatgagc ggatacatat ttgaatgtat ttagaaaaat aaacaaatag gggttccgcg 8220 
cacatttccc cgaaaagtgc cacctgacgt ctaagaaacc attattatca tgacattaac 8280 
ctataaaaat aggcgtatca cgaggccctt tcgtctcgca tgcggat<:ag. tgagggtttug 8340 
caactgcggg tcaaggatct ggatttcgat cacggcacga tcatcgtgcg ggagggcaag -8400 
ggctccaagg atcgggcctt gatgttaccc gagagcttgg -cacccagcct gcgcgagcag 84 60 
gggaattgat ccggtggatg accttttgaa tgacctttaa tagattatat tactaattaa 8520 
ttggggaccc tagaggtccc cttttttatt ttaaaaattt tttcacaaaa cggtttacaa 8580 
gcataacggg ttttgctgcc cgcaaacggg ctgttctggt gttgctagtt tgttatcaga 8640 
atcgcagatc cggcttcagg tttgccggct gaaagcgcta tttcttccag aattgccatg 8700 
■attttttccc cacgggaggc gtcactggct cccgtgttgt cggcagcttt gattcgataa 8760 
gcagcatcgc ctgtttcagg ctgtctatgt gtgactgttg agctgtaaca agttgt<:tca 8820 
-ggtgttcaat ttcatgttct agttgcttOg ttttactggt ttcacctgtt otattaggtg 8880 
ttacatgctg ttcatctgtt acattgtcga tctgttcatg gtgaacagct ttaaatgcac -€940 
caaaaactcg taaaagctct gatgtatcta tcttttttac accgttttca tctgtgcata 9000 
tggacagttt tccctttgat atctaacggt gaacagttgt t-ctacttttg tttgttagtc 9060 
ttgatgcttc actgatagat acaagagcca taagaacctc agatccttcc gtatt-tagcc 9120 
agtatgttct ctagtgtggt tcgttgtttt tgcgtgagcc atgagaacga accattgaga 91*80 
tcatgcttac tttgcatgtc actcaaaaat tttgcctcaa aactggtgag ctgaattttt 9240 
gcagttaaag catcgtgtag tgtttttctt agtccgttac gtaggtagga atctgatgta 9300 
atggttgttg gtattttgtc accattcatt tttatctggt tgttctcaag -ttcggttacg 9360 
agatccattt gtctatctag ttcaacttgg aaaatcaacg tatx:agt<:gg gcggcct-cgc 9420 
ttatcaacca ccaatttcat attgctgtaa gtgtttaaat ctttacttat tggtttcaaa 9480 
acccattggt taagcctttt aaactcatgg tagttatttt caagcattaa catgaactta 9540 
aattcatcaa ggctaatctc tatatttgcc ttgtgagttt tcttttgtgt tagttctttt 9600 
aataaccact cataaatcct catagagtat ttgttttcaa aagacttaac atgttccaga 9660 
ttatatttta tgaatttttt taactggaaa agataaggca atatctcttc actaaaaact 9720 
aattctaatt tttcgcttga gaacttggca tagtttgtcc actggaaaat ctcaaagcct 9780 
ttaaccaaag gattcctgat ttccacagtt -ctcgtcatca gctctctggt tgctttagct 9840 
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aatacaccat 


aagcattttc 


cctactgatg 


ttcatcatct 


gagcgtattg gttataagtg 9900 


aacgataccg 


tccgttcttt 


ccttgtaggg 


ttttcaatcg 


tggggttgag tagtgccaca 


9960 


cagcataaaa 


ttagcttggt 


ttcatgctcc 


gttaagtcat 


agcgactaat 


cgctagttca 


10020 

• 


tttgctttga 


aaacaactaa 


ttcagacata 


catctcaatt 


ggtctaggtg 


attttaatca 


10080 


ctataccaat 


tgagatgggc 


tagtcaatga 


taattactag 


tccttttcct 


ttgagttgtg 


10140 


ggtatctgta 


aattctgcta 


gacctttgct 


ggaaaacttg 


taaattctgc tagaccctct 


10200 


gtaaattccg 


ctagaccttt 


gtgtgttttt 


tttgtttata 


ttcaagtggt 


tataatttat 


10260 


agaataaaga 


aagaataaaa 


aaagataaaa 


agaatagatc 


ccagccctgt 


gtataactca 


10320 


ctactttagt 


cagttccgca 


gtattacaaa 


aggatgtcgc 


aaacgctgtt 


tgctcctcta 


10380 


caaaacagac 


cttaaaaccc 


taaaggctta 


agtagcaccc 


tcgcaagctc 


gggcaaatcg 


104 40 


ctgaatattc 


cttttgtctc 


cgaccatcag 


gcacctgagt 


cgctgtcttt 


ttcgtgacat 


10500 


tcagttcgct 


gcgctcacgg 


ctctggcagt 


gaatgggggt 


aaatggcact 


acaggcgcct 


10560 


tttatggatt 


catgcaagga 


aactacccat 


aatacaagaa 


aagcccgtca 


cgggcttctc 


10620 


agggcgtttt 


atggcgggtc 


tgctatgtgg 


tgctatctga 


ctcttcgctg 


ttcagcagtt 


10680 


cctgccctct 


gattttccag 


tctgaccact 


tcggattatc 


ccgtgacagg 


tcattcagac 


10740 


tggctaatgc 


acccagtaag 


gcagcggtat 


catcaacagg 


cttacccgtc 


ttactgtcaa 


10800 



c 10801 

<210> 77 
<211> 8654 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Recombinant 
pAN236 plasmid 

<400> 77 

ctcgaggcct acctagcttc caagaaagat atcctaacag cacaagagcg gaaagatgtt 60 

ttgttctaca tccagaacaa cctctgctaa aattcctgaa aaattttgca aaaagttgtt 120 

gactttatct acaaggtgtg gtataataat cttaacaaca gcaggacgct ctagaggagg 180 

agacatcatg aaaattggaa ttatcggcgg aggctccgtt ggtcttttat gcgcctatta 240 

tttgtcactt tatcacgacg tgactgttgt gacgaggcgg caagaacagg ctgcggccat 300 

tcagtctgaa ggaatccggc tttataaagg cggggaggaa ttcagggctg attgcagtgc 360 
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ggacacgagt atcaattcgg actttgacct gcttgtcgtg acagtgaagc agcatcagct 420 
t-caatctgtt tttt-cgtcgc ttgaacgaat <:gggaagacg aatatattat ttttgcaaaa 480 
cggcatgggg catatccacg acctaaaaga ctggcacgtt -ggccattcca *t*tatgt4>gg 540 
aatcgttgag <:acggagctg taagaaaajtc ggatacagct gttgatcata caggcctagg 600 
tgcgataaaa tggagcgcgt tcgacgatgc tgaaccagac xggctgaaca t<*tgtttca 660 
gcataaccat tcggattttc cgatt*atta tgagacggat tggtaccgtc tgctgacggg 720 
caagctgatt gtaaatgcgt gtattaatcc tttaactgcg ttattgcaag tgaaaaatgg 780 
agaactgctg acaacgccag cttatctggc ttttat^gaag ctggtattt-c aggaggcatg 840 
ccgcatttta aaacttgaaa atgaagaaaa ggcbtgggag -cgggtt-cagg ccgttt^tgg 900 
gcaaacgaaa gagaatcgtt catcaatgct ggttgacgtc attggaggcc ggcagacgga 960 
agctgacgcc attatcggat acttattgaa ggaagcaagt cttcaaggt*: -ttgatgecgt 1020 
ccacctagag tttttatatg gcagcatcaa agcattggag -cgaaatacaa acaaagtcfct 1O80 
ttgagctttt tcggtaacat gctatactca tttcggatca ctaactattt attggaggat 1140 
cctgttttgg cggatgagag aagattttca gcctgataca ^gattaaa-tca gaacgcagaa 1200 
gcggtctgat aaaacagaat ttgcctggcg gcagtagcgc ggt^gtceca ^.ctgacocca 1260 
tgccgaactc agaagtgaaa cgccgtagcg ccgatggtag tgtggggtct ccccatgcga 1320 
gagtagggaa ctgccaggca t^caaataaaa cgaaaggctc agtogaaaga -ctgggccttt 1380 
cgttttatct gttgtttgtc ggtgaacgct <:tcct^agta ggacaaatcc gccgggagcg 1440 
gatttgaacg ttgcgaagca acggcccgga gggtggcggg caggacgocc gccataaact 1SO0 
gccaggcatc aaattaagca gaaggccatc ctgacggatg gcct^tttgc gtt*<:tacaa 1560 
actcttggta cccagaaaaa gcggcaaaag cggct-gttaa aaaagcgaaa tcgaagaagc 1620 
tgtctgccgc taagacggaa tatcaaaagc gt-tctgctgt tgtgtcatct ttaaaagtca 1-680 
cagccgatga atcccagcaa gatgtoctaa aataottgaa -cacccagaaa -gataaaggaa 1740 
atgcagacca aattcattct tattatgtgg tgaacgggat tgctgttcat gcctcaaaag 1800 
aggttatgga aaaagtggtg cagttt<:ccg aagtggaaaa ggtgct.tcct aatgagaaac 1860 
ggcagctttt taagtcatcc tccccattta atatgaaaaa agcacagaaa gctattaaag 1920 
caactgacgg tgtggaatgg aatgtagacc aaatcgatgc -cccaaaagot *gggcac,ttg 1980 
gatatgatgg aactggcacg gttgttgcgt ccattgatac -cggggtggaa tggaatcatc 2040 
cggcattaaa agagaaatat -cgcggatata atccggaaaa tcctaatgag cctgaaaa^g 21O0 
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add tyadCLy 


gtatgatgcc 


yddCCCaCgi 


gacaggcacg 


t agcacctgg 


cgcaaaatgg 


ctgacatttt 


ggaagctggt 


acccggaaat 


ggctcctgat 


aatggtacag 


agacatggtc 


cggggaatac 


ggatctcttt 


atccagaatc 


gtttgcaact 


aaatcgctaa 


tgttgattac 


gagtaaaaga 


ttgtgctgaa 


agttgtatcg 


agtgtggttt 


caatgaaaca 


tggcattcag 


cagttcgttg 


gttgtttctc 


ataagagttt 


gtcagatatg 


ataaaaatct 


tgttggtttt 


cttataatca 


gcacatgcat 


aaaactacgt 


gaatcaaaaa 


atgatccaaa 


tgtaaaagtt 


aatcggcaat 


tgacgaaact 


atgaagaaaa 


gaatttgaaa 


taatctccta 


tggtggtttg 


aagaaggcga 


tttgattcat 


ttat tgcaat 


gtggaattgg 


gggccatatt 


gttgtataag 


aaatgtttta 


atcaagttta 


caattaaaag 


agggaagcgt 


gggtgaaatg 


tgaatacatc 


tggctttgca 


ttttatcttt 


cctgatattg 


caaatgattt 
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gtagcaggcg 


aggcaagccc 


atggtgggct 


ctgaacctga 


attgctgtta 


aagcgttctc 


gaatgggttt 


tagcaccaaa 


gttgtcaata 


actcatqqqq 


aatgcctggc 


qttcqqccqa 


attcccaacG 


oacctoattc 


qqaqcqacta 


agaattccaa 


tttgaacttc 


tgcatattct 


atattagagt 


ataaacaaaa 


tgtaaatcca 


ggctttgtcc 


tcacaaaagg 


ttat tactaa 


acattaacag 


ttaaaaatat 


gctcaaggat 


ttcaccoaat 

^ W r w U C4 U V* 


atgcgtgcaa 


caaaaataar 

\j u u ^ l» a 


gtattggtat 


atataaaacc 


caatggattc 


aat tttaaaa 

v_- \~ v_ \j \_j c-i a 


caaatgattc 


^v^u a a a a q 


gcaaaatatc 


ctgtaaagga 


cgtttgtctg 


atttoaaoaa 


ttaaaagaaa 


tacataaaaa 


acagatgatg 


acgaaaaagc 


gaacggaaaa 


attattttat 


tgatgaaata 


ctgaatttaa 


ggaggaatta 


attatgaagt 


atcattaacc 


ctataaacta 


ctattcacaa 


tcgaatttac 


ttttagcgta 


ttaaatgaaa 


taataaacca 


cctgcgagta 
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ttatgatgat 


ttggctcatg 


2160 


tgtjaacaaat 


caaatcggtg 


2220 


tgaagatggc 


ggcactgatg 


2280 


ggacgcggaa 


ggaaatcccc 


2340 


agggggctct 


( ggacttgatg 


2400 


tattttccct 


gagttttcag 


2460 


tatcgcaaat 


ccggcaaact 


2520 


ttccccatgg 


agagaaaaga 


2J580 


tgaatttaaa 


aaggctgaaa 


2640 


tcgtgaaaca 


ggcgaaagaa 


2700 


aatgtgcaac 


tggaggagag 


2760 


i 

agttattaaa 


caaaagccaa 


2820 


ttatgatggc 


gaagaattaa 


2880 


gatgcaatat 


aaaaaaatta 


2940 


aataaataat 


aaagataatt 


3000 


aacttatttt 


aagaatacag 


3060 


aaaggcaatg 


aaattagact 


3120 


taaatataaa 


tcggatatac 


3180 


tacggatttt 


atgaccgatg 


3240 


aggtttacac 


cgtaaaaggt 


3300 


attaaacctt 


gatgacacag 


3360 


cgatgaagat 


ggattttcta 


3420 


taaagagtag 


ttcaacaaac 


3480 


aacttagttt 


atatgtggta 


3540 


gtaatgaatg 


taacagggtt 


3600 


cgtctgccct 


cattattgga 


3660 


gacacaacca 


aattttaatt 


3720 


tggttttgaa 


cgtctcatta 


3780 


caaactgggt 


gaacacagcc 


3840 
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tttatgttaa ccttttccat -tggaacagct 
atcaaaaggt tactcctatt tggaattata 
gttggccatt ctttcttttc -cttacttatt 
gctgcatttc cagcactcgt aatggttgta 
ggtaaagcat ttggtcttat tggatcgata 
attggtggaa tgatagccca ttatattcat 
acaattatca ctgttccgtt tcttatgaaa 
cattttgata tcaaaggaat tatactaatg 
acaacatcat atagcatttc ttttcttatc 
aaacatatca ggaaagtaac agatcctttt 
tttatgattg gagttctttg tgggggaatt 
atggttcctt atatgatgaa agatgttcac 
attattttcc ctggaacaat gagtgtcatt 
gatagaagag gtcctttata cgtgttaaac 
ttaactgctt cctttctttt agaaacaaca 
gttttaggtg ggctttcgct caccaaaaca 
aaacagcagg aagctggtgc tggaatgagt 
ggaacaggta ttgcaattgt aggtggttta 
ttacctatgg aagttgatca gtcaacttat 
ggaatcattg tcattagttg gctggttacc 
ttctaaatcg ttaagggatc aactttggga 
cagttcgaag cggccgcaat tcttgaagac 
aggttaatgt catgataata atggtttctt 
tgcgcggaac ccctatttgt ttatttttct 
gacaataacc ctgataaatg cttcaataat 
atttccgtgt cgcccttatt cccttttttg 
cagaaacgct ggtgaaagta aaagatgctg 
tcgaactgga tctcaacagc ggtaagatcc 
caatgatgag cacttttaaa ^gttctgctat 
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gtatat-ggaa agctatctga t<:aattaggc 3900 
ataaattgtt t^cgggtcggt aattgg^ttt 3960 
atggct^gtt ttattcaagg ggctggtgca 4020 
gttgcgcgct atattccaaa ggaaaatagg 4080 
^gtagccatgg gagaaggagt -cggtccagcg 4140 
iiggtcctatc ttctactcat tcctatgata 4200 
ttattaaaga aagaagtaag gataaaaggt 4260 
tctgtaggca i:t<gtat*ttt tatgttgttt 4 320 
gttagcgtgc 4^tcat*cct gatat*t<gta 4380 
gttgatcccg gattagggaa aaatatacct 4 4 40 
atatttggaa *cagtagcagg gtttgtctct 4500 
cagctaagta ctgccgaaat -cggaagtgta 4560 
attttcggct acattggtgg gatacttgtt 4*620 
atcggagtta catttct-t-tc tgttagctt* 4-680 
tcatggttca tgacaattat aatcgtattt 4740 
gttatat<:aa «zaat*>gtttc aagtagcttg 4800 
ttgcttaact *<taccagctt tttatcagag <860 
ttatccatac ccttacttga t^caaaggttg 4 920 
ctgtatagta atttgttatt act^ttttca 4980 
tt^aatgtat ataaacattc tcaaag^gat 5040 
gagagttcaa aattgatcct ttttttataa 5100 
gaaagggcct cgtgatac^c ctatttttat 5160 
agacgt-cagg tggcactttt cggggaaatg 5220 
aaatacattc aaatatgtat ccgctcatga 5280 
at-tgaaaaag -gaagagtatg agtatt-caac 5340 
x:ggcattttg -ccttcctgtt tttgct-cacc 5400 
aagatcagtt gggtgcacga gtgggttaca 5460 
ttgagagttt tcgccccgaa gaacgttttc 5520 
gtggcgcggt at-tatcccgt attgacgc<:g 5580 
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ggcaagagca 


actcggtcgc 


cgcatacact 


at tctcagaa 


tgacttggtt gagtactcac 




cagtcacaga 


aaagcatctt 


acaaataaca 


taacaotaao 


agaattatgc 


agtgctgcca 


D /UU 


taaccatgag 


tgataacact 


gcggccaact 


tacttctciar 


aacgategga 


ggaccgaagg 


D fX>\J 


agctaaccgc 


ttttttgcac 


aacatggggg 


atcatataar 


tegecttgat cgttgggaac 




cggagctqaa 

777 _ 


tgaagccata 


ccaaacoaca 
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cacgatgcct 


gcagcaatgg 


coon 


caacaacgtt 


gcgcaaacta 


ttaactaaca 


aarf aft f ar 


tctagcttcc 


eggcaacaat 




taatagactg 


era t a aaaaca 

7 ** y y M y y y 


y a uaaay u _y 


Layy dLLdCL 


tctgcgctcg gcccttccgg 


6000 


ctqqctaqtt 


tattactaat 

w u w w wj w VMu L 


o. a a i_ y y a y 


l _yy Lydgcg 


tgggtctege 


ggtatcattg 


6060 


caacactaaa 


occaaataot 

^ w. a wj 0 w VJ w 


P. rr r» r* /*• t* rr r* 
a ay wv_ l_ WV— 


yLdtcytayC 


tatctacacg 


aeggggagtc 


6120 


aggcaactat 


aoataaacoa 


aauayaLaya 


i .yocydydL 


aggtgectea ctgattaagc 


6180 


attggtaact 


atcaaaccaa 

*— W U « d V— ' V* CI CI 


at" t"t"A^t*f^^t" 


atd.a.LLia 


gattgattta 


aaacttcatt 


6240 


tttaatttaa 

www v-* w* www w\ Wl 


aaanatptan 


y LyadvjdtCC 


tttLtyataa 


< 

tctcatgacc aaaatccctt 


6300 


aacgtgagt t 


t"t _ r'at*t~r , _~ , -"i."' 


Lyay ,y ■_ way 


dccccgtaga 


aaagatcaaa 


ggatcttctt 


6360 


gagatcct tt 


1 1 ttctacar 


rr f- a o t- +- rrr* h 
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y 1 tytaddC 


aaaaaaacca 


ccgctaccag 


6420 


cactaatt ta 


tttaccaaa t* 
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tccgaaggta 


actggcttca 


6480 


qcaqaqcaca 


gataccaaat 


ar>f fit* rrttr 

a V- uu w LLC 


lay Lyidy v_v- 


gtagttaggc 


caccacttca 


6540 


agaactctgt 




acata _LLLU 


c ugccaau 


cctyttacca 


gtggctgctg 


6600 


ccaataacaa 


tsao t cert" at" 
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wu Lactyyy u 


LyydtlCddy 


acgatagtta 


ceggataagg 


6660 


cacaqcaqtc 


aaactaaaca 


yyyyy <- 


npa^apanrp 
y -dudLdyct 


cagcttggag 


cgaacgacct 


6720 


acaccgaact 


oaaataccta 




tatydydddy 


cgccacgctt 


cccgaaggga 


6780 


gaaaggcgga 


caggtatccg 


gtaageggea 


yyy v- ^- y y a a v. 


aggagagege 


acgagggagc 
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ttccagggqq 


aaacgcctgg 


tatctttata 


at>cct" otrnfl 
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gtttcgccac 


ctctgacttg 


oyuu 


agcgtcgatt 


tttgtgatgc 


teat caaaac 


^ ^ ^ _i y y "~ 


atggaaaaac 


gccagcaacg 


t>y ou 


cggccttttt 


acggttcctg 


gecttttget 


ggccttttgc 


tcacatgttc tttcctgcgt 


1(\0C\ 
/u_u 


tatcccctga 


tcctgtggat 


aacegtatta 


ccgcctttga 


gtgagctgat 


accgctcgcc 


7080 


gcagccgaac 


gaccgagcgc 


agegagtcag 


tgagcgagga 


ageggaagag cgcctgatgc 


7140 


ggtattttct 


ccttacgcat 


ctgtgcggta 


tttcacaccg 


catatggtgc actctcagta 


7200 


caatctgctc 


tgatgecgea 


tagttaagee 


agtatacact 


ccgctatcgc 


tacgtgactg 


7260 


ggtcatggct 


gcgccccgac 


acccgccaac 


acccgctgac 


gcgccctgac 


gggcttgtct 


7320 
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gctcccggca tccgcttaca gacaagctgt gaccgtctcc ^ggagctgca tgtgtcagag 7380 
gttttcaccg tcatcaccga aacgcgcgag gcagctgcgg taaagotcat cagcgtggtc 7440 
gtgaagcgat tcacagatgt ctgcctgttc atccgcgtcc agctcgttga gtttctccag 7500 
aagcgttaat gtctggcttc tgataaagcg ggccatgtta agggcggttt tttcc*o;«t 75*60 
ggtcacttga tgcctccgtg taagggggaa tttctgtt<:a tgggggtaat gataecgatg 7 620 
aaacgagaga ggatgct<*c gatacgggtt actgatgatg aacatgcccg gttactggaa 7680 
cgttgtgagg gtaaacaact ggcggtatgg atgcggcggg accagagaaa aatcactcag 77 40 
ggtcaatgcc agcgcttcgt taatacagat gtaggtgttc cacagggtag ccagcagcat 7800 
cctgcgatgc agatccggaa cataatggtg cagggcgctg acttccgcgt ttccagactt 7860 
tacgaaacac ggaaaccgaa gaccattcat gttgttgctc aggtcgcaga cgttttgcag 7 920 
cagcagtcgc ttcacgttcg ctcgcgtatc ggtgattcat <ctgctaacc agtaaggcaa 7980 
ccccgccagc ctagccgggt cctcaacgac aggagcacga tcatgcgcac ccgtggccag 3040 
gacccaacgc tgcccgagat gcgccgcgtg cggctgotgg agatggcgga ^cgcgat^gat 8100 
atgttctgcc aagggttggt ttgcgcattc acagttctcc gcaagaat.tg at-tggcfrcca 3160 
atr.cttggag tggtgaatcc gttagcgagg tgccgccggc ttccat-tcag gtcgaggt^g 8220 
cccggctcca tgcaccgcga cgcaacgcgg ggaggcagac aaggtatagg gcggcgccta 3280 
caatccatgc caacccgttc catgtgctcg -ccgaggcggc ataaatcgcc gtgacga4:ca 3 340 
gcggtccagt gatcgaagtt aggctggtaa gagccgcgag <:gatcctt^a agctgtccct 8400 
gatggtcgtc atctacctgc .ctggacagca tggcctgcaa cgcgggcatc ccgatgccgc 84*60 
cggaagcgag aagaatcata atggggaagg >ccatccagcc tcgcgtcggc ggccgcttcg 8520 
tcgaccgaaa cagcagttat aaggcatgaa gctgtccggt ttttgcaaaa gt^gctqtga 8580 
ctgtaaaaag aaatcgaaaa agaccgtttt gtgtgaaaac ggtctttt-tg ttt<:ctttta 8640 
accaactgcc ataa 6654 

<210> 78 
<211> 8093 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Recombinant 
pAN4 23 plasmid 

<4 00> 78 

gqcggccgct tcgtcgaccg aaacagcagt tataaggcat gaagctgt<:c ggtttttgca 60 
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aaagtggctg 


tgactgtaaa aagaaatcga 


aaaagaccgt 


tttgtgtgaa 


aacggtcttt 


120 


ttgtttcctt 


ttaaccaact gccataactc gaggcctacc 


tagcttccaa 


gaaagatatc 


180 


ctaacagcac 


aagagcggaa agatgttttg ttctacatcc 


agaacaacct 


ctgctaaaat 


240 


tcctgaaaaa 


ttttgcaaaa agttgttgac 


tttatctaca 


aggtgtggta 


taataatctt 


300 


aacaacagca 


ggacgctcta gaaaaggagg 


aatttaaatg 


tatcgtacga 


'tgatgagcgg 


360 


caaacttcac 


agggcaactg ttacggaagc 


aaacctgaac 


tatgtgggaa 


gcattacaat 


420 


tgatgaagat 


ctcattgatg ctgtgggaat 


gcttcctaat 


gaaaaagtac 


aaattgtgaa 


480 


taataataat 


ggagcacgtc ttgaaacgta 


tattattcct 


ggtaaacggg gaagcggcgt 


540 


catatgctta 


aacggtgcaq ccgcacgcct 


tgtgcaggaa 


ggagataagg 


tcattattat 


600 


ttcctacaaa 


atgatgtctg atcaagaagc ggcaagccat 


gagccgaaag 


tggctgttct 


€60 


gaatgatcaa 


aacaaaattg aacaaatgct 


ggggaacgaa 


ccagqccgta 


caattttgta 


720 


aaggatcctg 


ttttggcgga tgagagaaga 


ttttcagcct 


gatacagatt 


aaatcagaac 


780 


gcagaagcgg 


tctgataaaa cagaatttgc 


ctggcggcag 


tagcgcggtg 


gt.cccacctg 


840 


accccatgcc 


gaactcagaa gtgaaacgcc 


gtagcgccga 


tggtagtgtg 


gggtctcccc 


900 


atgcgagagt 


agggaactgc caggcatcaa 


ataaaacgaa 


aggctcagtc gaaagactgg 


960 


gcctttcgtt 


ttatctgttg tttgtcggtg 


aacgctctcc 


tgagtaggac 


aaatccgccg 


1020 


ggagcggatt 


tgaacgttgc gaagcaacgg 


cccggagggt 


ggcgggcagg 


acgcccgcca 


1080 


taaactgcca 


ggcatcaaat taagcagaag 


gccatcctga 


cggatggcct 


ttttgcgttt 


1140 


ctacaaactc 


ttggtaccca gaaaaagcgg 


caaaagcggc 


tgttaaaaaa 


gcgaaatcga 


1200 


agaagctgtc 


tgccgctaag acggaatatc aaaagcgttc 


tgctgttgtg 


tcatctttaa 


1260 


aagtcacagc 


cgatgaatcc cagcaagatg 


tcctaaaata 


cttgaacacc 


cagaaagata 


1320 


aaggaaatgc 


agaccaaatt cattcttatt 


atgtggtgaa 


cgggattgct 


gttcatgcct 


1380 


caaaagaggt 


tatggaaaaa gtggtgcagt 


ttcccgaagt 


ggaaaaggtg 


cttcctaatg 


1440 


agaaacggca 


gctttttaag tcatcctccc 


catttaatat 


gaaaaaagca 


cagaaagcta 


.1500 


ttaaagcaac 


tgacggtgtg gaatggaatg 


tagaccaaat 


cgatgcccca 


aaagcttggg 


1560 


cacttggata 


tgatggaact ggcacggttg 


ttgcgtccat 


tgataccggg gtggaatgga 


1620 


atcatccggc 


attaaaagag aaatatcgcg gatataatcc 


ggaaaatcct 


aatgagcctg 


1680 


aaaatgaaat 


gaactggtat gatgccgtag caggcgaggc 


aagcccttat gatgatttgg 


1740 


ctcatggaac 


ccacgtgaca ggcacgatgg 


tgggctctga 


acctgatgga 


acaaatcaaa 


1800 
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tcggtgtagc acctggcgca aaatggattg ctgttaaagc gtfcctctgaa gatggcggca L860 
ctgatgctga -cattttggaa gctggtgaat gggttfctagc aocaaaggac gcggaaggaa 1920 
atccccaccc ggaaatggct cctgat-gtt^ <caataactc atggggaggg ^gctctggac 1980 
ttgatgaatg gtacagagac atggtcaatg cctggcgttc ggccgatat* tt«:ctgagt 2040 
tttcagcggg gaatacggat ctctttattc -ccggcgggcc tggttctatc gcaaatccgg 21O0 
caaactatcc agaatcgttt gcaactggag -cgactgagaa ttccaattcc ccatggagag 21'60 
aaaagaaaat cgctaatgtt gattactttg aacttctgca tattcttgaa tttaaaaagg 2220 
ctgaaagagt aaaagattgt gctgaaatat tagagtataa acaaaafccgt gaaacaggcg 2280 
aaagaaagtt gtatcgagtg tggttttgta aatccaggct ttgtccaatg -tgcaactgga 2340 
ggagagcaat gaaacatggc attcagtcac aaaaggttgt tgctgaagtt attaaacaaa 2400 
agccaacagt tcgttggttg tt-totcacat taacagttaa aaafcgfcttat gat^ggcgaag 24*60 
aattaaataa gagtttgtca gatatggctc aaggatttcg ccgaatgatg caatataaaa 2520 
aaattaataa aaatcttgtt ggttttatgc gtgcaacgga agtgacaata aataataaag 25S0 
ataattctta taatcagcac atgcatgtat tggtatgtgt ggaaccaact tattttaaga 2€40 
atacagaaaa ctacgtgaat caaaaacaat ggattcaatt <tggaaaaag gcaatgaaat 2700 
tagactatga tccaaatgta aaagttcaaa tgattcgacc gaaaaataaa tataaat-cgg 2760 
atatacaatc ggcaattgac gaaact^gcaa aatatcctgt aaaggatacg gattttatga -2820 
ccgatgatga agaaaagaat tfcgaaacgt* tgtctgattt ggaggaaggt *tacaocgta 2860 
aaaggttaat ctcctatggt ggtttgttaa aagaaataca taaaaaatta aaecttgatg 2940 
acacagaaga aggcgatttg attcatacag atxgatgacga aaaagccgat gaaga^ggat 3000 
tttctattat tgcaatgtgg aattgggaac ggaaaaatta ttttafctaaa gagta^ttca 3060 
acaaacgggc catattgttg tataagtgat gaaatactga atttaaaact tagtttatat 3120 
gtggtaaaat gttttaatca agtttaggag gaattaatta tgaagtgtaa tgaat^gtaac 3180 
agggttcaat taaaagaggg aagcgtatca ttaaccctat aaactacgt<: tgccctcatt 3240 
attggagggt gaaatgtgaa tacatcctat tcacaatcga atttacgaca caaccaaatt 3300 
ttaatttggc tttgcatttt atcttttttt agcgtattaa atgaaatggt tttgaacgtc 3360 
tcattacctg atattgcaaa tgattttaat aaaccacctg cgagtacaaa ctgggtgaac 3420 
acagccttta tgttaacctt ttccattgga acagctgtat atggaaagct atctgatcaa 3480 
ttaggcatca aaaggttact cctatttgga at-tataataa attgtttcgg gtcggtaatt 3540 
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gggtttgttg 


gccattcttt 


cttttcctta 


cttattatgg 


ctcgttttat 


tcaaggggct 


3600 


ggtgcagctg 


catttccagc 


actcgtaatg 


gttgtagttg 


cgcgctatat 


tccaaaggaa 


3660 


aataggggta 


aagcatttgg 


tcttattgga 


tcgatagtag 


ccatgggaga 


aggagtcggt 


3720 


ccagcgattg 


gtggaatgat 


agcccattat 


attcattggt 


cctatcttct 


actcattcct 


3780 


atgataacaa 


ttatcactgt 


tccgtttctt 


atgaaattat 


taaagaaaga 'agtaaggata 


3840 


aaaggtcatt 


ttgatatcaa 


aggaattata 


ctaatgtctg 


taggcattgt 


attttttatg 


3900 


ttgtttacaa 


catcatatag 


catttctttt 


cttatcgtta 


gcgtgctgtc 


attcctgata 


3960 


tttgtaaaac 


atatcaggaa 


agtaacagat 


ccttttgttg 


atcccggatt 


agggaaaaat 


4020 


atacctttta 


tgattggagt 


tctttgtggg 


ggaattatat 


ttggaacagt 


agcagggttt 


4080 


gtctctatgg 


ttccttatat 


gatgaaagat 


gttcaccagc 


taagtactgc cgaaatcgga 


4140 


agtgtaatta 


ttttccctgg 


aacaatgagt 


gtcattattt 


tcggctacat 


tggtgggata 


4200 


cttgttgata 


gaagaggtcc 


tttatacgtg 


ttaaacatcg 


gagttacatt 


tctttctgtt 


4260 


agctttttaa 


ctgcttcctt 


tcttttagaa 


acaacatcat 


ggttcatgac 


aattataatc 


4320 


gtatttgttt 


taggtgggct 


ttcgttcacc 


aaaacagtta 


tatcaacaat 


tgtttcaagt 


4380 


agcttgaaac 


agcaggaagc 


tggtgctgga 


atgagtttgc 


ttaactttac 


cagcttttta 


4440 


tcagagggaa 


caggtattgc 


aattgtaggt 


ggtttattat 


ccataccctt 


acttgatcaa 


4500 


aggttgttac 


ctatggaagt 


tgatcagtca 


acttatctgt 


atagtaattt 


gttattac-t 


4560 


ttttcaggaa 


tcattgtcat 


tagttggctg 


gttaccttga 


atgtatataa 


acattctcaa 


4620 


agggatttct 


aaatcgttaa 


gggatcaact 


ttgggagaga 


gttcaaaatt 


gatccttttt 


4 680 


ttataacagt 


tcgaagcggc 


cgcaattctt 


gaagacgaaa 


gggcctcgtg 


atacgcctat 


4740 


ttttataggt 


taatgtcatg 


ataataatgg 


tttcttagac 


gtcaggtggc actttt-cggg 


4800 


gaaatgtgcg 


cggaacccct 


atttgtttat 


ttttctaaat 


acattcaaat 


atgtatccgc 


4860 


tcatgagaca 


ataaccctga 


taaatgcttc 


aataatattg 


aaaaaggaag agtatgagta 


4 920 


ttcaacattt 


ccgtgtcgcc 


cttattccct 


tttttgcggc 


attttgcctt 


cctgtttttg 


4980 


ctcacccaga 


aacgctggtg 


aaagtaaaag 


atgctgaaga 


tcagttgggt 


gcacgagtgg 


5040 


gttacatcga 


actggatctc 


aacagcggta 


agatccttga 


gagttttcgc cccgaagaac 


5100 


gttttccaat 


gatgagcact 


tttaaagttc 


tgctatgtgg 


cgcggtatta 


tcccgtattg 


5160 


acgccgggca 


agagcaactc 


ggtcgccgca 


tacactattc 


tcagaatgac ttggttgagt 


5220 


actcaccagt 


cacagaaaag 


catcttacgg 


atggcatgac 


agtaagagaa 


ttatgcagtg 


5280 
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ctgccataac catqagtgat aacactgcgg ccaacttact totgacaacg atcggaggac 5340 
cgaaggagct aaccgctttt ttgcacaaca *gggcfgat<:a tgtaact«|c x:ttgat<:gtt 5400 



caatggcaac aacgttgcgc aaactattaa ctggcgaact acttactcta gcttcccggc 5520 
aacaattaat agactggatg gaggcggata aagttgcagg accacttctg cgctcggccc 5S80 
ttccggctgg ctggtttatt gctgataaat -ctggagccgg tgagcgtggg fcctcgcggta 5640 
tcattgcagc actggggcca gatggtaagc cctcccgtat cgtagttatc tacacgacgg 5700 
ggagtcaggc aactatggat gaacgaaata gacaga-tcgc tgagataggt gcctcactga 57^60 
ttaagcattg gtaactgtca gaccaagttt actcatatat a-ctttagatt gatttaaaac 15820 



ttcattttta atttaaaagg atctaggtga agatcctttt -tgataatctc atgaccaaaa 5880 
tcccttaacg tgagttttcg ttccactgag -cgtcagaccc <:gtagaaaag atcaaaggat 5940 
cttcttgaga tccttttttt ctgcgcgtaa tctgctgctt gcaaacaaaa aaaccaccgc *6000 
taccagcggt ggtttgtttg ccggatcaag agctaccaac tctttttccg aaggtaactg 6060 
gcttcagcag agcgcagata ccaaatactg bccttcta-gt gtagccgtag ttaggccacc 6120 
actrcaagaa ctctgtagca ccgcctacat acctcgctct gctaatcctg ttaccagtgg -6180 
ctgctgccag tggcgataag tcgtgtctta ccgggt«t<jga ctcaagacga tagttaccgg -6240 
ataaggcgca gcggtcgggc tgaacggggg gttcgtgcac acagcccagc ttggagcgaa -6300 
cgacctacac cgaactgaga tacctacagc gtgagctatg agaaagcgcc acgcttcecg 6360 
aagggagaaa ggcggacagg tatccggtaa gcggcagggt .cggaacagga gagcgcacga -6420 
gggagcttcc agggggaaac gcct^ggtatc tttatagtcc tgtcgggttt cgccacctct -6480 
gacttgagcg tcgatttttg tgatgctcgt caggggggcg -gagcctatgg aaaaacgcca 6540 
gcaacgcggc ctttttacgg ttcctggcct tttgctggcc tttt-gctcac atgttctt^tc ^600 
ctgcgttatc ccctgattct gtggataacc -gtattaccgc -ctttgagtga *gctgataccg 66*60 
ctcgccgcag ccgaacgacc gagcgcagcg agtcagtgag cgaggaagcg gaagagcgcc -6720 
tgatgcggta ttttctcctt acgcatctgt ^cggtatttc acaccgcata tggtgcactc 67-80 
tcagtacaat ctgctctgat gccgcatagt taagc<:agta tacactccgc tatcgctacg -6840 
tgactgggtc atggctgcgc cccgacaccc -gccaacaccc gctgacgcgc cctgacgggc 6900 
ttgtctgctc ccggcatccg cttacagaca a<gctgtgacc gtctccggga -gctgcatgtg *6960 
tcagaggttt tcaccgtcat caccgaaacg cgcgaggcag ctgcggtaaa *gct ca t cage 7020 



gggaaccgga gctgaatgaa gccataccaa aegacgageg 



tgacaccacg atgcctgcag 5460 
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gtggtcgtga 


agcgattcac 


agatgtctgc 


ctgttcatcc 


gcgtccagct cgttgagttt 


7080 


ctccagaagc 


gttaatgtct 


ggcttctgat 


aaagcgggcc 


atgttaaggg cggttttttc 


7140 


ctgtttggtc 


acttgatgcc 


tccgtgtaag 


ggggaatttc 


tgttcatggg ggtaatgata 


7200 


ccgatgaaac 


gagagaggat 


gctcacgata 


cgggttactg 


atgatgaaca tgcccggtta 


7260 


ctggaacgtt 


gtgagggtaa 


acaactggcg 


gtatggatgc 


ggcgggacca gagaaaaatc 


7320 


acteagggtc 


aatgccagcg 


cttcgttaat 


acagatgtag 


gtgttccaca gggtagccag 


7380 


cagcatcctg 


cgatgcagat 


ccggaacata 


atggtgcagg 


gcgctgactt ccgcgtttcc 


74 40 


agactttacg 


aaacacggaa 


accgaagacc 


attcatgttg 


ttgctcaggt cgcagacgtt 


lioo 


ttgcagcagc 


agtcgcttca 


cgttcgctcg 


cgtatcggtg 


attcattctg ctaaccagta 


7560 


aggcaacccc 


gccagcctag 


ccgggtcctc 


aacgacagga 


gcacgatcat gcgcacccgt 


7620 


ggccaggacc 


caacgctgcc 


cgagatgcgc 


cgcgtgcggc 


tgctgpagat ggcggacgcg 


7680 


atggatatgt 


tctgccaagg 


gttggtttgc 


gcattcacag 


ttctccgcaa gaattgattg 


7740 


gctccaattc 


ttggagtggt 


gaatccgtta 


gcgaggtgcc 


gccggcttcc attcaggtcg 


7800 


aggtggcccg 


gctccatgca 


ccgcgacgca 


acgcggggag 


gcagacaagg tatagggcgg 


7860 


cgcctacaat 


ccatgccaac 


ccgttccatg 


tgctcgccga 


ggcggcataa atcgccgtga 


7920 


cgatcagcgg 


tccagtgatc 


gaagttaggc 


tggtaagagc 


cgcgagcgat ccttgaagct 


7980 


gtccctgatg 


gtcgtcatct 


acctgcctgg 


acagcatggc 


ctgcaacgcg ggcatc<:cga 


804 0 


tgccgccgga 


agcgagaaga 


atcataatgg 


ggaaggccat 


ccagcctcgc gtc 


8093 



<210> 79 
<211> 8098 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Recombinant 
pAN429 plasmid 

<400> 79 

ggcggccgct tcgtcgaccg aaacagcagt tataaggcat gaagctgtcc ggtttttgca 60 

aaagtggctg tgactgtaaa aagaaatcga aaaagaccgt tttgtgtgaa aacggtcttt 120 

ttgtttcctt ttaaccaact gccataactc gaggcctacc tagcttccaa gaaagatatc 180 

ctaacagcac aagagcggaa agatgttttg ttctacatcc agaacaacct ctgctaaaat 240 

tcctgaaaaa ttttgcaaaa agttgttgac tttatctaca aggtgtggta taataatctt 300 
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aacaacagca ggacgctcta gattagaaag gaggtttaat taatgtatcg tacgatgatg 360 
agcggcaaac ttcacagggc aactgttacg gaagcaaacc tgaactat^t gggaagcatt 420 
acaattgatg aagatctcat tgatgctgtg ggaatgcttc ctaatgaaaa agtacaaatt 480 
gtgaataata ataatggagc acgtcttgaa acgtatatta ttcctggtaa acggggaagc 540 
ggcgtcatat gcttaaacgg tgcagccgca cgccttgtgc aggaaggaga taaggt^att €00 
attatttcct acaaaatgat gtctgatcaa gaagcggcaa gccatgagcc gaaagtggct 660 
gttctgaatg atcaaaacaa aattgaacaa atgctgggga acgaaccagc -ccgtacaatt 720 
ttgtaaagga tcctgttttg gcggatgaga gaagattttc agcctgatac agattaaat-c 780 
• agaacgcaga agcggtctga taaaacagaa tttgcctggc ggcagtagcg -cggtggtccc 840 
acctgacccc atgccgaact cagaagtgaa acgccgtagc gccgatggta gtgtggggtc 900 
tccccatgcg agagtaggga actgccaggc atcaaataaa acgaaaggct cagtcgaaag 960 
actgggcctt tcgttttatc tgttgtttgt cggtgaacgc tctcctgagt aggacaaatc 1020 
cgccgggagc ggatttgaac gttgcgaagc aacggcccgg agggtggcgg gcaggacgcc 1O80 
cgccataaac tgccaggcat caaattaagc agaaggccat cctgacggat ggcctttttg 1140 
cgtttctaca aactcttggt acccagaaaa agcggcaaaa gcggctgtta aaaaagcgaa 1200 
atcgaagaag ctgtctgccg ctaagacgga atatcaaaag -cgttctgctg ntgtgtcatc 1260 
tttaaaagtc acagccgatg aatcccagca agatgtccta aaatacttga acacccagaa 1320 
agataaagga aatgcagacc aaattcattc ttattatgtg gtgaacggga ttgctgttca 1380 
tgcctcaaaa gaggttatgg aaaaagtggt gcagtttccc gaagtggaaa aggtgcttcc 1440 
taatgagaaa cggcagcttt ttaagtcatc ctccccattt aatatgaaaa aagcacagaa 1500 
agctattaaa gcaactgacg gtgtggaatg gaatgtagac caaatcgatg ccccaaaagc 1560 
' ttgggcactt ggatatgatg gaactggcac ggttgttgcg tccattgata ccggggtgga 1620 
atggaatcat ccggcattaa aagagaaata tcgcggatat aatccggaaa atcctaatga 1680 
gcctgaaaat gaaatgaact ggtatgatgc cgtagcaggc gaggcaagcc cttatgatga 1740 
tttggctcat ggaacccacg tgacaggcac gatggtgggc tctgaacctg atggaacaaa 1800 
tcaaatcggt gtagcacctg gcgcaaaatg gattgctgtt aaagcgttct ctgaagatgg 1860 
cggcactgat gctgacattt tggaagctgg tgaatgggtt ttagcaccaa aggacgcgga 1920 
aggaaatccc cacocggaaa tggctcctga tgttgtcaat aactcatggg gagggggctc 1980 
tggacttgat gaatggtaca gagacatggt caatgcctgg cgttcggccg atatttt-coc 5040 
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taaacacaac 


ctttatatta 


atcaattagg 


catcaaaagg 


taattgggtt 


tgttggccat 


gggctggtgc 


agetgeattt 


aggaaaatag 


gggtaaagca 


tcggtccagc 


gattggtgga 
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eggatctett tattcccggc 
cgtttgcaac tggagegact 
atgttgatta ctttgaactt 
attgtgctga aatattagag 
gagtgtggtt ttgtaaatcc 
atggcattca gtcacaaaag 
ggttgtttct cacattaaca 
tgtcagatat ggctcaagga 
ttgttggttt tatgegtgea 
ageacatgea tgtattggta 
tgaatcaaaa acaatggatt 
atgtaaaagt tcaaatgatt 
ttgacgaaac tgcaaaatat 
agaatttgaa acgtttgtct 
atggtggttt gttaaaagaa 
atttgattca tacagatgat 
tgtggaattg ggaacggaaa 
tgttgtataa gtgatgaaat 
aatcaagttt aggaggaatt 
gagggaagcg tatcattaac 
gtgaatacat cctattcaca 
attttatctt tttttagcgt 
gcaaatgatt ttaataaacc 
accttttcca ttggaacagc 
ttactcctat ttggaattat 
tctttctttt ccttacttat 
ccagcactcg taatggttgt 
tttggtctta ttggatcgat 
atgatagece attatattca 
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yyyuctggtX 


etategcaaa 


21vU 


yayaaLLCCa 


attccccatg 


2160 


t/tgCdtdttC 


ttgaatttaa 


2220 


tacaaacaaa 


ategtgaaac 


2280 


aggctttgtc 


caatgtgcaa 


2340 


gttgttgctg 


aagttattaa 


2400 


gttaaaaatg 


tttatgatgg 


2460 


tttcgccgaa 


tgatgeaata 


2520 


acggaagtga 


caataaataa 


2580 


tgtgtggaac 


caacttattt 


2640 


caattttgga 


aaaaggcaat 


2700 


cgaccgaaaa 


ataaatataa 


2760 


cctgtaaagg 


ataeggattt 


2820 


gatttggagg 


aaggtttaca 


2880 


atacataaaa 


aattaaacct 


2940 


gacgaaaaag 


ccgatgaaga 


3000 


aattatttta 


ttaaagagta 


3060 


actgaattta 


aaacttagtt 


3120 


aattatgaag 


tgtaatgaat 


3180 


cctataaact 


acgtctgccc 


3240 


atcgaattta 


cgacacaacc 


3300 


aLLdddLyaa 


atggttttga 


3360 


accugegagt 


acaaactggg 


3420 


t,y idtdcgga 


aagctatctg 




aataaattgt 


ttegggtegg 


3540 


tatggctcgt 


tttattcaag 


3600 


agttgcgcgc 


tatattccaa 


3660 


agtagccatg 


ggagaaggag 


3720 


ttggtcctat 


cttctactca 


3780 
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ttcctatgat aacaattatc actgttccgt ttcttat-gaa attattaaag aaagaagtaa 3840 
ggataaaagg tcattttgat atcaaaggaa ttatactaat -gtctgtaggc attgtatttt 3900 
ttatgttgtt tacaacatca tatagcattt cttttcttat <:gttagcgtg ct^tcattcc 3960 
tgatatttgt aaaacatatc aggaaagtaa -cagatccttt tgttgatccc ggattaggga 4020 
aaaatatacc ttttatgatt ggagttcttt gtgggggaat tatatttgga acagtagcag 4080 . 
ggtttgtctc tatggttcct tatatgatga aagatgttca ccagctaagt actgccgaaa 4140 
tcggaagtgt aattattttc cctggaacaa tgagtgtcat tattttcggc tacattggtg 4200 
ggatacttgt tgatagaaga ggtcctttat acgtgttaaa catcggagtt acatttcttt 4260 
ctgttagctt tttaactgct tcctttcttt tagaaacaac atcatggttc atgacaatta 4320 
taatcgtatt tgttttaggt gggctttcgt tcaccaaaac agttatatca acaatt-gttt 4380 
caagtagctt gaaacagcag gaagctggtg ctggaatgag tttgcttaac ttt'accagct 4440 
ttttatcaga gggaacaggt attgcaattg taggtggttt attatccata -cccttactt^ 4500 
atcaaaggtt gttacctatg gaagttgatc agt-caactta tctgtatagt aatttgttat 4560 
tacttttttc aggaatcatt gtcattagtt' ggctggttac cttgaatgta tataaacatt 4€20 
ctcaaaggga tttctaaatc gt-taagggat caactttggg agagagtrca aaattgatcc 4680 
tttttttata acagttcgaa gcggccgcaa ttcttgaaga cgaaagggcc tcgtgataeg 4740 
cctattttta taggttaatg tcatgataat aatggtttot ^agacgtcag gtggcactt-t 4800 
tcggggaaat gtgcgcggaa cccctatttg tttatttttc taaatacatt -caaatatgta 4860 
tccgctcatg agacaataac cctgataaat gcttcaataa tattgaaaaa ggaagagtat 4920 
gagtattcaa catttccgtg tcgcccttat tccctttttt gcggcatttt gccttcctgt 4980 
ttttgctcac ccagaaacgc tggtgaaagt aaaagatgct .gaagatcagt tgggtgcacg 5040 
agtgggttac atcgaactgg atctcaacag cggtaagatc cttgagagtt tt<:gccccga 51O0 
agaacgtttt ccaatgatga gcactttttaa agttctgcta <gtggcgcgg tattatcccg 51-60 
tattgacgcc gggcaagagc aactcggtcg ccgcatacac tattctcaga atgacttggt 5220 
tgagtactca ccagtcacag aaaagcat<ct tacggatggc atgacagtaa gagaattatg 5280 
cagtgctgcc ataaccatga gtgataacac tgcggccaac ttacttctga -caacgatcgg 5340 
aggaccgaag gagctaaccg cttttttgca <:aacatgggg gatcatgtaa ctcgccttga 5400 
tcgttgggaa ccggagctga atgaagccat accaaacgac gagcgtgaca ccacgatgcc 54-60 
tgcagcaatg gcaacaacgt tgcgcaaact attaactggc gaactactta -ctctagcttc 5520 
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ctttttgata 


atctcatgac 


5880 


ctgagcgtca 


gaccccgtag 


aaaagatcaa 


5940 


<-.y u a a t ct g c 


tgcttgcaaa 


caaaaaaacc 


6000 


tea a gag eta 


ccaactcttt 


ttccgaaggt 


6060 


4- -» ^ 4- m4- ^ ^ i- 

tactgtcctt 


ctagtgtagc 


cgtagttagg 


6120 


tacatacctc 


♦ 

getctgetaa 


tcctgttacc 


6180 


tettaceggg 


ttggactcaa 


gacgatagtt 


6240 


ggggggttcg 


tgcacacagc 


ccagcttgga 


6300 


acagegtgag 


ctatgagaaa 


gcgccacgct 


6360 


ggtaagegge 


agggteggaa 


caggagagcg 


6420 


gcatct ttat 


agtcctgtcg 


ggtttcgcca 


6480 


ctegtcaggg 


gggeggagee 


tatggaaaaa 


6540 


ggee Ltttgc 


tggccttttg ctcacatgtt 


6600 


caaccgLaLL 


accgcctttg agtgagctga 


6660 


cagegagtea 


gtgagegagg aageggaaga 


6720 


tct gtgegg c 


atttcacacc 


gcatatggtg 


6780 


aJ" anf"faanr 
otay wuaayv« 


cagtatacac tccgctatcg 


6840 


vdLLcyccdd 


cacccgctga 


cgcgccctga 


6900 


a rr a a a not* r? 
ayacddyCLy 


tgaccgtctc 


egggagctge 


6960 


aaacgegega 


ggcagctgcg gtaaagctca 


7020 


tctgcctgtt 


catccgcgtc cagctcgttg 


7080 


ctgataaagc 


gggccatgtt 


aagggcggtt 


7140 


gtaaggggga 


atttctgttc 


atgggggtaa 


7200 


egataegggt 


tactgatgat 


gaacatgccc 


7260 



Dwcrw»irv ,wn 
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ggttactgga acgttgtgag ggtaaacaac tggcggtatg gatgcggcgg gaccagagaa 7320 
aaatcactca gggt-caatgc cagcgcttcg ttaatacaga tgtaggtgtt ccaca^ggta 7360 
gccagcagca tcctgcgatg cagatccgga acataatggt gcagggcgct gacttccgcg 7440 
tttccagact ttacgaaaca cggaaaccga agaccatt-ca tgttgttgct caggtcgcag 7500 
acgttttgca gcagcagtcg cttcacgttc gctcgcgtat cggfcgattca ttctgctaac 7560 . 
cagtaaggca accccgccag -cctagccggg tcctcaacga caggagcacg atcatgcgca 7620 
cccgtggcca ggacccaacg ctgcccgaga t-gcgccgcgt gcggctgctg gagatggcgg 7680 
acgcgatgga tatgttctgc caagggttgg tttqcgcatt cacagttctc -cgcaagaatt 7740 
gattggctcc aattcttgga gtggtgaatc cgttagcgag gtgccgccgg cttccattca 7800 
ggtcgaggtg gcccggctcc atgcaccgcg acgcaacgcg gggaggcaga -caaggtatag 7860 
ggcggcgcct acaatccatg ccaacccgtt ccatgtgctc gccgag^cgg ^ataaatogc 7 920 
cgtgacgatc agcggtccag tgatcgaagt taggctggta agagccgcga -gcgatccttg 7980 
aagctgtccc tgatggtcgt catctacctg cctggacagc atggcctgca acgcgggcat 8040 
cccgatgccg ccggaagcga gaagaatcat' aatggggaag gccatccagc ctcgcgtc £098 

<210> 80 
<211> 4450 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: .Recombinant 
pAN4 4 3 plasmid 

<400> 80 

ccatcgaatg gccagatgat <aattcctaa tttttgttga -cactctat<:a ttgatagagt 60 
tattttacca ctccctatca -gtgatagaga aaagtgaaat gaatagtfccg acaaaaatct 120 
agattagaaa ggaggattta aatatgagac aqattactga tatttcacag cfcgaaagaag L80 
ccataaaaca ataccattca gagggcaagt caatcggatt tgttccgacg atggggtttc 240 
tgcatgaggg gcatttaacc ttagcagaca aagcaagaca agaaaacgac gccgttatta 300 
tgagtatttt tgtgaatcct gcacaattcg gccctaatga agattttgaa gcatatccgc 360 
gcgatattga gcgggatgca gctcttgcag aaaacgccgg agtcgatatt ctttttacgc 4 20 
cagatgctca tgatatgtat cccggtgaaa agaatgtcac gattcatgta gaaagacgca 480 
cagacgtgtt atgcgggcgc tcaagagaag gacattttga icggggtcgcg atcgtactga 540 
cgaagctttt caatctagtc aagccgactc gtgcctattt cggtttaaaa -gatgcgcagc 600 



flNRnOTOO: <WO 0121772A2 1 > 



WO 01/21772 



- Ill - 



PCT/US00/25993 



aggtagctgt tgttgatggg ttaatcagcg 
tcgatacggt cagagaggaa gacggcttag 
ctgaggaaag aaaagaagcg cctaagctgt 
tccaagccgg tgaaagagat cctgaagcgg 
cgactagcgg aaccatagac tatgtagagc 
atgaaattgc tggaaagatg attctcgctg 
tagataatat cattattgat attcgtagaa 
atgagcggca aacttcacag ggcaactgtt 
attacaattg atgaagatct cattgatgct 
attgtgaata ataataatgg agcacgtctt 
agcggcgtca tatgcttaaa cggtgcagcc 
attattattt cctacaaaat gatgtctgat 
gctgttctga atgatcaaaa caaaattgaa 
attttgtaaa ggatcccccg gggatccctc 
agcgcttgga gccacccgca gttcgaaaaa 
gcgcacattg tgcgacattt tttttgtctg 
acgcgccctg tagcggcgca ttaagcgcgg 
ctacacttgc cagcgcccta gcgcccgctc 
cgttcgccgg ctttccccgt caagctctaa 
gtgctttacg gcacctcgac cccaaaaaac 
catcgccctg atagacggtt tttcgccctt 
gactcttgtt ccaaactgga acaacactca 
aagggatttt gccgatttcg gcctattggt 
acgcgaattt taacaaaata ttaacgctta 
tgcgcggaac ccctatttgt ttatttttct 
gacaataacc ctgataaatg cttcaataat 
atttccgtgt cgcccttatt cccttttttg 
cagaaacgct ggtgaaagta aaagatgctg 
tcgaactgga tctcaacagc ggtaagatcc 



acttcttcat ggatattgaa ttggttcctg 660 
ccaaaagctc tcgcaatgta tacttaacag 720 
atcgggccct tcaaacaagt gcggaacttg 7aO 
tgataaaagc tgcaaaagat atcattgaaa 840 
tttattccta tccggaactc gagcctgtga 900 
ttgcagttgc tttttcaaaa' gcgcgtttaa 960 
aggaggtgaa ttaatatgta tcgtacgatg 1020 
acggaagcaa acctgaacta tgtgggaagc 1080 
gtgggaatgc ttcctaatga aaaagtacaa 1140 
gaaacgtata ttattcctgg taaacgggga 1200 
gcacgccttg tgcaggaagg agataaggtc 1260 
caagaagcgg caagccatga gccgaaagtg 1320 
caaatgctgg ggaacgaacc agcccgtaca 1380 
gaggtcgacc tgcaggggga ccatggtctc 1440 
taataagctt gacctgtgaa gtgaaaaatg 1500 
ccgtttaccg ctactgcgtc acggatctcc 1560 
cgggtgtggt ggttacgcgc agcgtgaccg 1620 
ctttcgcttt cttcccttcc tttctcgcca 1680 
atcgggggct ccctttaggg ttccgattta 1740 
ttgattaggg tgatggttca cgtagtgggc 1800 
tgacgttgga gtccacgttc tttaatagtg 1860 
accctatctc ggtctattct tttgatttat 1920 
taaaaaatga gctgatttaa caaaaattta 1980 
caatttcagg tggcactttt cggggaaatg 2040 
aaatacattc aaatatgtat ccgctcatga 2100 
attgaaaaag gaagagtatg agtattcaac 2160 
cggcattttg ccttcctgtt tttgctcacc 2220 
aagatcagtt gggtgcacga gtgggttaca 2280 
ttgagagttt tcgccccgaa gaacgttttc 2340 
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caatgatgag cacttttaaa gttctgctat 
ggcaagagca actcggtcgc <cgcatacact 
cagtcacaga aaagcatctt acggatggca 
taaccatgag tgataacact gcggccaact 
agctaaccgc ttttttgcac aacatggggg 
cggagctgaa tgaagccata ccaaacgacg 
caacaacgtt gcgcaaacta ttaactggcg 
tgatagactg gatggaggcg gataaagttg 
ctggctggtt tattgctgat aaatctggag 
cagcactggg gccagatggt aagccctccc 
, aggcaactat ggatgaacga aatagacaga 
attggtagga attaatgatg tctcgtttag 
. agctgcttaa tgaggtcgga atcgaaggtt 
gtgtagagca gcctacattg tattggcatg 
tagccattga gatgttagat aggcaccata 
ggcaagattt tttacgtaat aacgctaaaa 
atggagcaaa agtacattta ggtacacggc 
atcaattagc ctttttatgc caacaaggtt 
gcgcagtggg gcattttact ttaggttgcg 
aagaagaaag ggaaacacct actactgata 
aattatttga tcaccaaggt gcagagccag 
gcggattaga aaaacaactt aaatgtgaaa 
cgtgatggta acttcactag tttaaaagga 
tgaccaaaat cccttaacgt gagttttcgt 
tcaaaggatc ttcttgagat cctttttttc 
aaccaccgct accagcggtg gtttgtttgc 
aggtaactgg cttcagcaga gcgcagatac 
taggccacca ctt<:aagaac tctgtagcac 
taccagtggc tgctgccagt ggcgataagt 



- 112- 

gtggcgcggt attatcccgt attgacgccg £400 
attcfccagaa tgacttggtt gagtactcac 24 60 
tgacagtaag agaattatgc agtgctgcca 2520 • 
tacttctgac aacgatcgga -ggaccgaagg 2580 
at<catgtaac tcgccttgat cgtt^gggaac 2640 
agcgtgacac -cacgatgoct .gtagcaatgg 27O0 
aactacttac tctagcttcc -cggcaacaat £7*60 
■caggaccact tctgcgctcg goccttccgg 2*8-20 
ccggtgagcg tgqctctcgc ggtatcattg 2880 

gtatcgtagt tatctacacg acggggagtc 2940 

i 

fccg.ctgagat aggtgoctca otgattaagc 3000 
ataaaagtaa agtgattaac agcgcattag 3060 
taacaacccg taaactcgcc <cagaagctag 3120 
taaaaaataa gcgggcttcg ctcgacgcct 3180 
ct-cacttttg ccctttagaa ggggaaagct 3240 
gttttagatg tgctttacta agtcatcgcg 3300 
ctacagaaaa acagtatgaa actctcgaaa 3360 
ttfccactaga gaat-gcat-ta tatgcact-ca 34 20 
tattggaaga tcaagagcat caagtcgcta 3480 
gtatgccgcc attattacga caagctabcg 3540 
ccttcttatt -cggccttgaa ttgatcatat 3600 
gtgggtctta aaagcagcat aacctttttc 36€0 
tctaggtgaa gatccttttt gataatctca 37-20 
tccactgagc gtcagacccc -gtagaaaaga 37S0 
tgcgcgtaat ctgctgcttg caaacaaaaa 3840 
cggatcaaga gctaccaact *:tttttt:cga 3900 
caaatactgt -oct.t-ctagtg tagccgtagt 3960 
cgcctacata cctcgctctg ctaatcctgt 4O20 
cgtgtcttac cgggttggac tcaagacgat 4080 
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agttaccgga taaggcgcag cggtcgggct gaacgggggg ttcgtgcaca cagcccagct 4140 

tggagcgaac gacctacacc gaactgagat acctacagcg tgagctatga gaaagcgcca 4200 

cgcttcccga agggagaaag gcggacaggt atccggtaag cggcagggtc ggaacaggag 4 260 

agcgcacgag ggagcttcca gggggaaacg cctggtatct ttatagtcct gtcgggtttc 4 320 

gccacctctg acttgagcgt cgatttttgt gatgctcgtc aggggggcgg agcctatgga 4 380 

aaaacgccag caacgcggcc tttttacggt tcctggcctt ttgctggcct. tttgctcaca 4 4 40 
tgacccgaca 445 q 

<210> 81 
<211> 10212 
<212> DNA 

<213> Artificial Sequence ( 
<220> 

<223> Description of Artificial Sequence: Recombinant 
pAN251 plasmid 



<400> 81 
gcggccgcta 


aaaagagctt 


gaggatttgc 


ggagtgaaaa 


tcagacattg 


cggaatcagc 


60 


tagagatgac 


agaagaggat 


tacaaggcac 


tgatcgatat 


catggatcgg 


gccagaaaaa 


120 


tggttgtttc 


gaaggaagac 


ggaagaatga 


aaaaagcggc 


tcaagaaacg 


taaagaaacg 


180 


cctgaaatga 


accggcccta 


tagtaagaat 


aggccggttg 


ttttgatttc 


tatgcagact 


240 


ctcccggtgt 


catttcgcga 


tccatatcag 


gatgccagat 


gagcgggtct 


tcccctttgt 


300 


cccgcgccat 


atcatactta 


acagttttaa 


agttcatttt 


gttccaaaat 


tccgctgatt 


360 


tcattctcgg 


atttgtccgg 


atcggcattt 


tgaatgattt 


tgcaaattca 


accaaggctc 


420 


tcccgtatcc 


cctgttctgg 


tagcccggaa 


gaacctcaag 


cttccacagc 


tccaaataat 


480 


cctggcggtt 


gtcaaaatag 


ggattcgatt 


tgccgttaac 


ttgatacaga 


ctcattcgtg 


540 


ctacaagttt 


atcgccaaaa 


taaatcccgt 


aaaaaggcga 


ggtgctgtca 


ttttcaataa 


600 


tattatcctg 


aagttcttca 


agcattgaaa 


gctcctgaat 


gccgtattct 


ttgaatttct 


660 


tgaattcttc 


cagcgtttta 


tagttgataa 


gcagacgttc 


tacctttgtc aaacaaatct 


720 


ccccctttgt 


tgtttctaca 


tatattgtaa 


acgctttatt 


taaaaaatcc 


aaatatttaa 


780 


actttaattt 


taagcacatg 


ggatctttga 


gaagtaattt 


cttcttactt 


ctgctatgat 


840 


aatacgtaaa 


tgcgtcgacc 


gaaacagcag 


ttataaggca 


tgaagctgtc cggtttttgc 


900 


aaaagtggct 


gtgactgtaa 


aaagaaatcg 


aaaaagaccg 


ttttgtgtga aaacggtctt 


960 
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tttgtttcct tttaaccaac ttjccataact cgaggcctac ctagcttcca agaaagatat 1020 
cctaacagca caagagcgga aa<gatgtttt gttctacatc cagaacaacc tctgctaaaa 1O80 
ttcctgaaaa attttgcaaa aagttgtt^a ctttatctac aaggtgt^gt ataataatct 114iO 
taacaacagc aggacgctct agaggaggag acatcatgaa aattggaatt at^cggcggag 1200 
gctccgttgg tcttttatgc gcctat.tatt tgtcacttta tcacgacgtg actgt^tgtga 1260 
cgaggcggca agaacaggct gcggccattc agt«ctgaagg aatccggctt *ataaaggcg 1320 
gggaggaatt cagggctgat tgcagtgcgg acacgagtat caattcggac tttgacctgc 1380 
ttgtcgtgac agtgaagcag catcagcttc aatctgtttt t<cgt<:gctt gaacgaat-cg 1440 
ggaagacgaa tatattattt ttgcaaaacg -gcatggggca tatccacgac <:taaaagact 1500 
ggcacgttgg ccattccatt tatgttggaa t«cgttgagca cggagctgta agaaaatcgg 1560 
atacagctgt tgatcataca ggcctaggtg <cgataaaatg gagcgcgttc gacgatgctg r620 
aaccagaccg gctgaacatc ttgtttcagc ataaceatix: ggattttccg afcttattatg W80 
agacggattg gtaccgtctg ctgacgggca agctgattgt aaatgcgtgt attaat<cctt 17 40 
taactgcgtt attgcaagtg aaaaatggag aactgctgac aacgccagct tat-ctggctt 1600 
ttatgaagct ggtatttcag gaggcatgcc gcattttaaa acttgaaaat gaagaaaagg I860 
cttgggagcg ggttcaggcc gtttgtgggc aaacgaaaga gaatcgttca tcaatgctgg 1920 
ttgacgtcat tggaggccgg cagacggaag ctgacgccat tatcggatac ttattgaagg 1980 
aagcaagtct tcaaggtctt gatgc<:gtcc acctagagtt tttatatggc agcatcaaag 204O 
cattggagcg aaatacaaac aaagt>ctttt gagctttttc ggtaacatgc tatactcatt 2100 
tcggatcact aactatttat tggagaaagg aagttctaga agatgcagct aact-gaactt 21*60 
tccatcaaaa atcagaatgt gtttgtacag <:actatatag atggcaaaga agaaatgtct 2220 
tctttttttg attacagtat tcatcataag gacatgtggc gcgaaagact ggaagactta 2280 
tcttcccggt ttttcgcaag agaggaattg gcggcgtact taacctctta -ccataataaa 2340 
ttcggttcaa gtgcgatgca gtctgctatt gagaagctga aggacccgtc aagtgccgct 24O0 
gtagtcggcg gacagcaggc aggactttta acaggaccgc tttacaccat acataaaatc 24-60 
atttcaatca ttgt.tttagc aaagcaacaa gaaaaggaac tgcaagtgcc tgtcatacca 2520 
atcttctggg tggctggaga agaccacgat ttggatgaga ttaattttgt *cacacat<:t 2580 
gaagagaatg ggcctgtgaa aaaaaagctg cctcagtctt attggaagaa atcatcagca 2640 
gcgagtacat cgcttgatca ggaaaagtgt gccgcgtgga tagatgatgt tt*tgccgct 2700 
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tttgaagaaa cagaccatac gaatacactt 
tctgttacgt ttactgactt ctttgaactg 
ttagttttat taaattctgg ggatcctgtt 
'tacagattaa atcagaacgc agaagcggtc 
gcgcggtggt cccacctgac cccatgccga 
gtagtgtggg gtctccccat gcgagagtag 
gctcagtcga aagactgggc ctttcgtttt 
agtaggacaa atccgccggg agcggatttg 
cgggcaggac gcccgccata aactgccagg 
gatggccttt ttgcgtttct acaaactctt 
ttaaaaaagc gaaatcgaag aagctgtctg 
ctgttgtgtc atctttaaaa gtcacagccg 
tgaacaccca gaaagataaa ggaaatgcag 
ggattgctgt tcatgcctca aaagaggtta 
aaaaggtgct tcctaatgag aaacggcagc 
aaaaagcaca gaaagctatt aaagcaactg 
atgccccaaa agcttgggca cttggatatg 
ataccggggt ggaatggaat catccggcat 
aaaatcctaa tgagcctgaa aatgaaatga 
gcccttatga tgatttggct catggaaccc 
ctgatggaac aaatcaaatc ggtgtagcac 
tctctgaaga tggcggcact gatgctgaca 
caaaggacgc ggaaggaaat ccccacccgg 
ggggaggggg ctctggactt gatgaatggt 
ccgatatttt ccctgagttt tcagcgggga 
gttctatcgc aaatccggca aactatccag 
ccaattcccc atggagagaa aagaaaatcg 
ttcttgaatt taaaaaggct gaaagagtaa 
aaaatcgtga aacaggcgaa agaaagttgt 



PCT/US00/25993 

- 115 - 

ctcgacaatg tgaaacgatq tttaagggaa 2760 
ctgatcgcgg attitgttcca agaagagggc 2820 
ttggcggatg agagaagatt ttcagcctga 2880 
tgataaaaca gaatttgcct ggcggcagta 2940 
actcagaagt gaaacgccgt agcgccgatg 3000 
ggaactgcca ggcatcaaat aaaacgaaag 3060 
atctgttgtt tgtcggtgaa cgctctcctg 3120 
aacgttgcga agcaacggcc cggagggtgg 3180 
catcaaatta agcagaaggc catcctgacg 3240 
ggtacccaga aaaagcggca aaagcggctg 3300 
ccgctaagac ggaatatcaa aagcgttctg 3360 
atgaatccca gcaagatgtc ctaaaatact 3420 
accaaattca ttcttattat gtggtgaacg 3480 
tggaaaaagt ggtgcagttt cccgaagtgg 3540 
tttttaagtc atcctcccca tttaatatga 3600 
acggtgtgga atggaatgta gaccaaatcg 3660 
atggaactgg cacggttgtt gcgtccattg 3720 
taaaagagaa atatcgcgga. tataatccgg 3780 
actggtatga tgccgtagca ggcgaggcaa 3840 
acgtgacagg cacgatggtg ggctctgaac 3900 
ctggcgcaaa atggattgct gttaaagcgt 3960 
ttttggaagc tggtgaatgg gttttagcac 4020 
aaatggctcc tgatgttgtc aataactcat 4 080 
acagagacat ggtcaatgcc tggcgttcgg 4140 
atacggatct ctttattccc ggcgggcctg 4200 
aatcgtttgc aactggagcg actgagaatt 4260 
ctaatgttga ttactttgaa cttctgcata 4320 
aagattgtgc tgaaatatta gagtataaac 4380 
atcgagtgtg gttttgtaaa tccaggcttt 4440 



WO 01/21772 PCT/US00/25993 

- H6- 

gtccaatgtg <:aactggagg agagcaatga aacatggcat tcagtcacaa aaggttgttg 4500 
ctgaagttat taaacaaaag ccaacagttc gttggttgtt tctcacatta acagttaaaa 4560 
atgtttatga tggcgaagaa ttaaataaga gtttgtcaga tat^gct<:aa ggatttcgcc 4*20 
gaatgatgca atataaaaaa attaataaaa atcttgttgg ttttatgcgt ^caacggaag 4680 
tgacaataaa taataaagat aattcttata atcagcacat gcatgtattg gtatgtgtgg 4740 
aaccaactta ttttaagaat acagaaaact acgtgaatca aaaacaatgg attcaatttt 4800 
ggaaaaaggc aatgaaatta gactatgatc caaat-gtaaa agttcaaatg attcgaccga 4860 
aaaataaata taaatcggat atacaatcgg caattgacga aactgcaaaa tatcctgtaa 4 920 
aggatacgga ttttatgacc gatgatgaag aaaagaatt* gaaacgtttg t-ctgatttgg 4980 
aggaaggttt acaccgtaaa aggttaatct cctatggtgg tttgttaa'aa gaaatacata 5040 
aaaaattaaa ccttgatgac acagaagaag gcgatttgat tcatacagat gatgacgaaa 5100 
aagccgatga agatggattt tctattattg caatgtggaa ttgggaacgg aaaaattatt 51-60 
ttattaaaga gtagttcaac aaacgggcca tat-tgt-tgta taagtgatga aatactgaat 52-20 
ttaaaactta gtttatatgt ggt'aaaatgt tttaatcaag t-ttaggagga attaattatg 5280 
aagtgtaatg aatgtaacag ggttcaatta aaagagggaa gcgtatcatt eaccctataa 5340 
actacgtctg ccct-cattat tggagggtga aatgtgaata catcctattc acaatcgaat 5400 
ttacgacaca accaaatttt aatttggctt tgcattttat ctttttttag cgtattaaat 5460 
gaaatggttt tgaacgtctc attacctgat attgcaaatg attttaataa accacctgcg 5S20 
agtacaaact gggtgaacac agcctttatg ttaacotttt ccattggaac agctgtatat 5580 
ggaaagctat ctgatcaatt aggcatcaaa aggttactcc tatt<tggaat tataataaat 564-0 
tgtttcgggt cggtaattgg gtttgttggc cattctttct ttt<:cttact tattatggct "5700 
cgttttattc aaggggctgg tgcagctgca tttocagcac -tcgtaatggt tgtagttgcg 5760 
cgctatattc caaaggaaaa taggggtaaa gcatttggtc ttattggatc .gatagtagcc 5820 
atgggagaag gagtcggtcc agcgattggt ggaatgatag eccattatat t-cattggtcc 5880 
tatcttctac tcattcctat gataacaatt atcactgttc -cgtttcttat gaaattatta 5940 
aagaaagaag taaggataaa aggtcatttt gatatcaaag gaattatact aatgtctgta 6000 
ggcattgtat tttttatgtt gtttacaaca tcatatagca tttcttttct tat-cgttagc .6060 
gtgctgtcat tcctgatatt tgtaaaacat atcaggaaag taacagat<:c ttttgttgat 6120 
cccggattag ggaaaaatat accttttatg attggagttc tttxgtggggg aattatattt 6180 



RNROOOO. <WO 012I772A2 I > 



WO 01/21772 PCT/US00/25993 

- 117 - 

ggaacagtag cagggtttgt ctctatggtt ccttatatga tgaaagatgt tcaccagcta 6240 
agtactgccg aaatcggaag tgtaattatt ttccctggaa caatgagtgt cattattttc 6300 
ggctacattg gtgggatact tgttgataga agaggtcctt tatacgtgtt aaacatcgga 6360 
. gttacatttc tttctgttag ctttttaact gcttcctttc ttttagaaac aacatcatgg 6420 
ttcatgacaa ttataatcgt atttgtttta ggtgggcttt cgttcaccaa aacagttata 64 80 
tcaacaattg tttcaagtag cttgaaacag caggaagctg gtgctggaat gagtttgctt 6540 
aactttacca gctttttatc agagggaaca ggtattgcaa ttgtaggtgg tttattatec 6600 
atacccttac ttgatcaaag gttgttacct atggaagttg atcagtcaac ttatctgtat 6660 
agtaatttgt tattactttt ttcaggaatc attgtcatta gttggctggt taccttgaat 6720 
gtatataaac attctcaaag ggatttctaa atcgttaagg gatcaacttt gggagagagt 6780 
tcaaaattga tccttttttt ataacagttc gaagcggccg caattcttga agacgaaagg 6840 
gcctcgtgat acgcctattt ttataggtta atgtcatgat aataa'tggtt tcttagacgt 6900 
caggtggcac ttttcgggga aatgtgcgcg gaacccctat ttgtttattt ttctaaatac 6960 
attcaaatat gtatccgctc atgagacaat aaccctgata aatgcttcaa taatattgaa 7020 
aaaggaagag tatgagtatt caacatttcc gtgtcgccct tattcccttt tttgcggcat 7080 
tttgccttcc tgtttttgct cacccagaaa cgctggtgaa agtaaaagat gctgaagatc 7140 
agttgggtgc acgagtgggt tacatcgaac tggatctcaa cagcggtaag atccttgaga 7200 
gttttcgccc cgaagaacgt tttccaatga tgagcacttt taaagttctg ctatgtggcg 7260 
cggtattatc ccgtattgac gccgggcaag agcaactcgg tcgccgcata cactattctc 7320 
agaatgactt ggttgagtac tcaccagtca cagaaaagca tcttacggat ggcatgacag 7380 
taagagaatt atgcagtgct gccataacca tgagtgataa cactgcggcc aacttacttc 74 40 
tgacaacgat cggaggaccg aaggagctaa ccgctttttt gcacaacatg ggggatcatg 7500 
taactcgcct tgatcgttgg gaaccggagc tgaatgaagc cataccaaac gacgagcgtg 7560 
acaccacgat gcctgcagca atggcaacaa cgttgcgcaa actattaact ggcgaactac 7620 
ttactctagc ttcccggcaa caattaatag actggatgga ggcggataaa gttgcaggac 7680 
cacttctgcg ctcggccctt ccggctggct ggtttattgc tgataaatct ggagccggtg 7740 
agcgtgggtc tcgcggtatc attgcagcac tggggccaga tggtaagccc tcccgtatcg 7800 
tagttatcta cacgacgggg agtcaggcaa ctatggatga acgaaataga cagatcgctg 7860 
agataggtgc ctcactgatt aagcattggt aactgtcaga ccaagtttac tcatatatac 7920 
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tttagattga tttaaaactt catttttaat ttaaaaggat ctaggtgaag atcctttttg 7980 
ataatctcat gaccaaaatc ccttaacgtg agttttcgtt ocactga^cg tcagaccccg 8040 
tagaaaagat caaaggatct tcttgagatc ctttttttct gcgcgtaatc tgctgcttgc 8100 
aaacaaaaaa accaccgcta ccagcggtgg tttgtttgcc ^gafccaagag ctaccaactc 8160 
tttttccgaa ggtaactggc ttcagcagag cgcagatacc aaataotgtc cttctagtgt 8220 
agccgtagtt aggccaccac ttcaagaact ctgtagcacc gcctacatac ctcgctctgc 8280 
taatcctgtt accagtggct gctgccagtg gcgataagtc gtgt<:ttacc gggttggact 8340 
caagacgata gttaccggat aaggcgcagc ggtcgggctg aacggggggt tcgtgcacac 8400 
agcccagctt ggagcgaacg acctacaccg aactga^ata cctacagcgt gagctatgag 8460 
aaagcgccac gcttcccgaa gggagaaagg <:ggacaggta *ccggtaagc ggcagggtcg 8520 
gaacaggaga gcgcacgagg gagcttccag ggggaaacgc ctggtatxrtt tatagtcctg 8560 
tcgggtttcg ccacctctga cttgagcgtc gatttttgtg atgctcgtca ggggggcgga 8640 
gcctatggaa aaacgccagc aacgcggcct ttttacggtt cctggcct^tt tgctggcctt 8700 
ttgctcacat gttctttcct gcgttatccc ctgattctgt ggataaccgt attaccgcct 8760. 
ttgagtgagc cgataccgct cgccgcagcc gaacgaccga gcgcagcgag tcagtgagcg 9820 
aggaagcgga agagcgcctg atgcggtatt ttctccttac -gcatctgtgc ggtatttcac 8880 
accgcatatg gtgcactctc agtacaatct gctctxgatgc cgcatagtta agccagtata 8940 
cactccgcta tcgctacgtg actgggtcat ggctgcgccc cgacacccgc caacacccgc 9000 
tgacgcgccc tgacgggctt gtctgctccc ggcatccgct tacagacaag ctgtgaccgt 9060 
ctccgggagc tgcatgtgtc agaggttttc accgtcatca ecgaaacgcg cgaggcagct 9L20 
gcggtaaagc tcatcagcgt ggtcgtgaag cgattcacag atgtctgcct gttcatccgc . 9180 
gtccagctcg ttgagtttct ccagaagcgt taatgtctgg cttctgataa agcgggccat 9240 
gttaagggcg gttttttcct gtttggtcac ttgatgcctc cgtgtaaggg ggaatttctg 9300 
ttcatggggg taatgatacc gatgaaacga gagaggatgc tcacgatacg ggttactgat 9360 
gatgaacatg cccggttact ggaacgttgt gagggtaaac aactggcggt atggatgcgg 9420 
cgggaccaga gaaaaatcac tcagggtcaa tgccagcgct tcgttaatac agatgtaggt 9480 
gttccacagg gtagccagca gcat<:ctgcg atgcagatcc ggaacataat ggtgcagggc 9540 
gctgacttcc gcgtttccag actttacgaa acacggaaac cgaagaecat tcatgttgtt 9600 
gctcaggtcg cagacgtttt gcagcagcag tcgcttcacg ttcgctcgcg tatcggtgat 9660 
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tcattctgct 


aaccagtaag gcaaccccgc cagcctagcc 


gggtcctcaa 


cgacaggagc 


9720 


acgatcatgc 


gcacccgtgg ccaggaccca acgctgcccg 


agatgegecg 


cgtgcggctg 


9780 


ctggagatgg 


cggacgcgat ggatatgttc tgccaagggt 


tggtttgcgc 


attcacagtt 


9840 


ctccgcaaga 


attgattggc tccaattctt ggagtggtga 


ateegttage gaggtgeege 


9900 


cggcttccat 


tcaggtcgag gtggcccggc tccatgcacc 


gcgacgcaac 


geggggagge 




dtjdCdaggLd 


tagggcggcg cctacaatcc atgccaaccc 


gttccatgtg ctcgccgagg 


10020 


cggcataaat 


cgccgtgacg atcagcggtc cagtgatcga 


agttaggctg 


gtaagagccg 


10080 


cgagcgatcc 


ttgaagctgt ccctgatggt cgtcatctac 


ctgcctggac 


ageatggect 


10140 


gcaacgcggg 


catcccgatg ccgccggaag cgagaagaat 


cataatgggg 


aaggccatcc 


10200 


agcctcgcgt 


eg 






10212 



<210> 82 

<211> 10426 ' 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Recombinant 
pAN267 plasmid 



<400> 82 
aacaaaattc 


tccagtcttc 


acatcggttt 


gaaaggagga 


ageggaagaa tgaagtaaga 


60 


gggatttctg 


actccgaagt 


aagtcttcaa 


aaaatcaaat 


aaggagtgtc aagaatgttt 


120 


gcaaaacgat 


tcaaaacctc 


tttactgccg 


ttattcgctg 


gatttttatt gctgtttcat 


180 


ttggttctgg 


caggaccggc 


ggctgcgagt 


getgaaaegg 


cgaacaaatc gaatgagctt 


240 


acagcaccgt 


cgatcaaaag 


eggaaccatt 


ettcatgeat 


ggaattggtc gttcaatacg 


300 


ttaaaacaca 


atatgaagga 


tattcatgat 


gcaggatata 


cagccattca gacatctccg 


360 


attaaccaag 


taaaggaagg 


gaatcaagga 


gataaaagca 


tgtcgaactg gtactggctg 


420 


tatcagcega 


categtatea 


aattggcaac 


cgttacttag 


gtactgaaca agaatttaaa 


480 


gaaatgtgtg 


cagccgctga 


agaatatggc 


ataaaggtca 


ttgttgacgc ggtcatcaat 


540 


cataccacca 


gtgattatgc 


cgcgatttcc 


aatgaggtta 


agagtattcc aaactggaca 


600 


catggaaaca 


cacaaattaa 


aaactggtct 


gatcgaaata 


gtacataatg gatttcctta 


660 


cgegaaatae 


gggcagacat 


ggcctgcccg 


gttattatta 


tttttgacac cagaccaact 


720 


ggtaatggta 


gcgaccggcg 


ctcaggatcg 


tctcggtacc 


aagagtttgt agaaaegcaa 


780 


aaaggecate 


cgtcaggatg 


gccttctgct 


taatttgatg 


cctggcagtt tatggeggge 


840 
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gtcctgcccg -ccaccctccg ggccgttgct tcgcaacgt* caaatccgct cccggcggat 900 
ttgtcctact caggagagcg tt<caccgaca aacaacagat aaaacgaaag -gcccagtctt 960 
tcgactgagc ctttcgtttt atttgatgcc tggcagttcc ctactctcgc -atggggagac 1020 
cccacactac catcggcgct acggcgtttc acttctgagt tcggcatggg gtcaggtggg 1080 
accaccgcgc tactgccgcc aggcaaattc tgttttatca gaccgcttct gcgttctgat 1140 
ttaatctgta tcaggctgaa aatctt-ctct cat-ccgccaa aacaggatcc at<:acgaagc 1200 
gtcgtatcga aaaaattaat tttgcgcaac ggagaccacc gcttccttct tcttgccttg 1260 
tttcacaaac ggcatcattt cacgaagctc <cttcccact acttcgattt gatgttcgtt 1320 
ctcgcttgca ttgatagcgt tgaaacgagg acggtttact «tggttttcga cgatccactc 1380 ( 
ttttgcgaat gtaccgtttt ggatatcttt taatacttct ttcatagat* cttttacttt 14 40 
agcgtccaca acgcgagggc ctgatacgaa atctccccac tgfcgctgtgt cagagattga 1500 
atatctcatt cctgcaagtc cttcttcgta cataaggtct acgattaatt tcagctcatg 1560 
aagacactcg aagtatgcaa gttcaggctg ataacctgct -tcagttaagg tttcaaatcc 1620 ... 
ggctttgaca agcgcgctta atccgccgca aagaactgct tgctcaccga acaaatctgt 1680 
ttctgtttct tctttaaatg tcgtttctaa tacgcccgct -cttgcgccgc .cgattccttt 1740 
agcataagcg agggctttgt ctcttgcttx: tccagtcaca «tcttgataga «gcgaacaa 1800 
tgcaggtacg ccagctcctt gctcatatgt t<:ttcttacc aagtgtcecg ggcctttagg 18*60 
ggccactaag aatacatcta catccgccgg aggaacaatt tgatggaaat gcacgttaaa 1920 
tccatgagcg aatactaatg attttcctgc tgtcaattca tctttgattt -cagcttcgta 1980 
tactttttgc tgctgctcat ccggaagcag aaccatgatg atttcggctt gggctgccgc 2040 ; 
ttcttttact gaaaatactt tatgtccgtc ttcttgggct tgagtgaaag atfrttccttg 2100 
tctaacaccg acgatcacgt ctactccgct ttctt*aagg -ttcagggcaU gtgcgtggcc 2160 
ttgcgaaccg tacccgataa ccgctactgt tttt<:cagcc aatacgttct -ctttgatatc 2220 
accgttataa tatactttta ccatttcaat cbctcccttg -ttatgtttta tacaatagat 2280 
attgttttat tggatgacgc cttttgctgg ttcccct-cgc aaaagccgtt gtacctgttc 2340 
tcgcgatttc tttaatgcca taaggtttta ataactcaat aagcgcttoa attttgttag 2400 
attcacctgt cacctgaaca acgatgctgt «ctctgctgac atcaacgaca gaggctctaa 2460 
acggttctat gattccatta atctctgttc ttgttgaagg tgcggagaca accttgatta 2520 
. aggccagctc cctctggaca atcgattgat ttgtgatgtc fcgtgactttc ageacatcaa 2580 
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tctgtttgtt 


gagctgtttc 


gttaactgtt 


caacatcatt 


ttcaccttca acatgaacga 


2640 


cgaaggtgat 


tctggaaacg 


ccggctgttt 


ctgtgtgtcc 


aactgtaatg ctttcaatgt 


2700 


tgtaatgcct 


ttttgtgaat 


agaccggtga 


tccggtttaa 


caccccggag cggttcacca 


2760 


cagtcaatgt 


gataattctt 


ttcaaggttt 


cacccccacc 


atttcatgca gccctttccc 


2820 


cggagccacc 


atcgggaata 


ctttttcttc 


gctggcaacc 


cgcacgtcaa 'tgacaacagg 


2880 


ttctcttgat 


gttaatgcct 


cttccagctt 


ttcctttgct 


tccgcttctg atgaaattct 


2940 


gatgccttta 


atgccgtatg 


cttcggacaa 


tttgacgaag 


tcaggctgag aagcgaattt 


3000 


agattctgaa 


taacgttctt 


catagaaaat 


ttcctgccac 


tgtctgacca ttccgagaca 


3060 


agcgttattt 


aaaatcacta 


ccttgaccgg 


aagatttaat 


tcgcgaataa catcgagttc 


3120 


ttgaagcgtc 


atttggaatc 


cgccgtctcc 


gacaaccgcg 


acaacagtag catctttttc 


3180 


ggccagctgt 


gcgccgatcg 


ccgccggaag 


accgaatccc 


atcgttccaa gtccgcctga 


3240 


cgtgacccat 


ttatctgctt 


tttggaacgg 


ataaaattgc 


gctgaccaca tttgatgctg 


3300 


gcctacatcc 


gttgcgacaa 


tggcctctcc 


ttttgtaaat 


tgatgaatat attcaatcaa 


3360 


tttctgaggt 


ttaaaacctt 


cttcttcatt 


atctacatac 


cagagcggat actcttcttt 


34 20 


ccattctgcg 


agctgttttt 


tccattcgct 


tgaatcgctt 


tgtttgccgt cttgtttgat 


3480 


cagctcctgc 


aggacaattt 


tgctgtctcc 


gactacagga 


atctgtgttt tcatgatttt 


3540 


tccgatttca 


gctggatcaa 


tatcgatgtg 


ggctatcttt 


gcgtttctgg caaagtgttt 


3600 


caggtttcct 


gtgacacggt 


catcaaaacg 


ggcgccgata 


ctgattaata gatcacattc 


3660 


atgaagggcc 


atattggctg 


tataagtacc 


gtgcattccc 


gccatcccta ggaaaagcgg 


3720 


atggtcagcc 


gggaagcctc 


cgagccccaa 


aagggtgtgt 


gccacaggga tttgctgctg 


3780 


ttcagcataa 


ttttttaatt 


cttctgacgc 


ttttccgtgc 


agtacgcccg cacccgccag 


3840 


gatcaccggt 


tttttcgcac 


tgctcacggc 


ttccacaagc 


ttgcggatct gcaaataatt 


3900 


cggctctgtt 


gtcggctggt 


atcccgggag 


attcatctca 


tgatcgtagc tgaattctcc 


3960 


ttcaattgtt 


gctacatctt 


tcggaatatc 


aatcaataca 


ggtccgggtc ttccagttgt 


4020 


tgcaatatgg 


aacgcttctt 


taatgatgcg 


cggcagatct 


tccggctggc gaacctggta 


4080 


gctgtgtttt 


gttactggca 


tcgtaatccc 


taaaatgtct 


gcttcctgaa atgcatcgct 


4140 


cccgattaca 


gaggttgcta 


cctgccctgt 


aaagacgact 


aacggcaatg aatcaatcat 


4200 


ggcatcagca 


aggcctgtaa 


caaggtttgt 


cgctcccggc 


ctgacgtggc aatgacgaca 


4260 


ccggtttccg 


gagacccttg 


cgtatccctc 


cgctgcatga 


attgctcctt gttcgtgacg 


4 320 
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ggaaggatat gtaccaacct gaatgtatag cttatcgtaa atcggaagca cagccccgcc 4 380 
cggataaccg aagatcattt •ctact-tbctc tttttt-taat ^gattcaatca gcattaatcg 4 440 
'tccgctcatc gtctgtgtac attcggcaga tgctgaatcc acctgtacat tagtccccat 4500 
tttatctcct cctctagagc gtcctgctgt tgttaagatt attataccac accttgtaga 4 560' 
taaagtcaac aactttttgc aaaatttttc aggaatttta gcagaggttg ttctggatgt 4620 
agaacaaaac atctttccgc tcttgtgctg *ttaggatat<: £ttx:t£ggaa gctaggtagg 4630 
cctcgagtta tggcagttgg ttaaaaggaa acaaaaagac cgtttt<caca caaaacggtc 4740 
tttttcgatt tctttttaca gtcacagcca cttttgcaaa aaccg^a<^ag ctt<:atgcct 4800 
tataactgct gtttcggtcg acgaagcggc cgccgtttaa acgaattcct gcagctggcg 4860 
aatggcgatt ttcgttcgtg aatacatgtt ataataacta taactaataa cgtaacgtga 4920 
ctggcaagag atatttttaa aacaatgaat aggtttacac ttactttagt tttatggaaa 4 980 
tgaaagatca tatcatatat aat-ctagaat aaaabtaact aaaataatta ttatctagat *S040 
.aaaaaattta gaagccaatg aaatctataa ataaactaaa ttaagtttat <taatraaca 5100 
actatggata taaaataggt actaatcaaa atagtgagga ggatatattt gaatacatac 5160 
gaacaaatta ataaagtgaa aaaaatactt cggaaacatt taaaaaataa ccttattggt 5220 
acttacatgt ttggatcagg agttgagagt ggactaaaac «caaatagtga t-cttgacttt 5-280 
ttagtcgtcg tatctgaacc attgacagat caaagtaaag aaatacttat acaaaaaatt 5340 
agacctattt caaaaaaaat aggagataaa agcaacttac -gatatattga attaacaatt 54O0 
attattcagc aagaaatggt accgtggaat catcctceca aacaagaatt tatttatgga 5460 
gaatggttac aagagcttta tgaacaagga tacatt-cctc agaaggaatt aaatrtcagat 5520 
ttaaccataa tgctttacca agcaaaacga aaaaataaaa -gaatatacgg aaat«:a£gac 5580 
ttagaggaat tactacctga tattccattt tct^afcgtga -gaagagccat tatggattcg 5640 
tcagaggaat taatagataa ttatcaggat gatgaaacca aotctatatt aactt^tat^c 5700 
cgtatgattt taactatgga .cacgggtaaa at-cataccaa aagatattgc gggaaatgca 5760 
gtggctgaat cttctccatt agaacatagg gagagaattt tgttagcagt tcgtagttat 5820 
cttggagaga atattgaatg gactaatgaa aatgtaaafct taactataaa ctatttaaat 5880 
aacagattaa aaaaattata aaaaaattga aaaaatggfcg gaaacacttt tttcaa.tttt 5940 
tttgttttat tatttaatat ttgggaaata ttcattctaa ttggtaatca gattttagaa 6000 
aacaataaac ccttgcatag ggggatcgat atccgtttag gotgggcggt gatagcttct 6060 
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cgttcaggca gtacgcctct tttcttttcc 
ttcccgggga aaatgccaaat aggcgatcgc 
actgcggtca atagatttca caatgtgatg 
aatggaaaca accagatatt tatgaatcag 
gcaggttcat cctctgtctc tatcaatacg 
aataaagctg gagcgggttc atttcaagtg 
gccaggtctg tagctgtgct ttatcctgat 
gagaattaca aaacaggtgt aacacattct 
gcagatgcga atacaacaaa agccgtttat 
ttaaggatgg agatcaattc acaatcggaa 
atcatgttaa aaggaacgaa cagtgatggt 
aaaagagatc cagcgtcqgc caaaaccatc 
gtaaatgctt atatctataa acatgatggg 
cctggaaaac caatgactaa aaatgcagac 
acggatacaa ccaacgcaaa agtgattttt 
aatcagcctg gctttgatta cgtgctaaat 
tctcttcccc attgagggca aggctagacg 
tttctttttt gttcataaat cagacaaaac 
cacaatataa atgtgaaata cttcacaaac 
aaggatgctg atattgtctg catttgcgcc 
cttgaattag tagaaaagaa cttgaagatt 
agcggatttg acggcatttt cttagtcggg 
agaattggag ccaatcaatt cttgcggaga 
aacatatcca tcgcgtccgc catctccagc 
gggtcctggc cacgggtgcg catgatcgtg 
cggggttgcc ttactggtta gcagaatgaa 
tgctgctgca aaacgtctgc gacctgagca 
tcgtaaagtc tggaaacgcg gaagtcagcg 
gcaggatgct gctggctacc ctgtggaaca 



agacctgagg 


gaggcggaaa tggtgtgagg 


6120 


gggagtgctt 


tatttgaaga tcaggctatc 


6180 


gctggacagc 


ctgaggaact ctcgaacccg 


6240 


cgcggctcac 


atggcgttgt gctggcaaat 


6300 


gcaacaaaat 


tgcctgatgg caggtatgac 


6360 


aacgatggta 


aactgacagg cacgatcaat 


6420 


gatattgcaa 


aagcgcctca tgttttcctt 


6480 


ttcaatgatc 


aactgacgat taccttgcgt 


6540 


caaatcaata 


atggaccaga cgacaggcgt 


6600 


aaggagatcc 


aatttggcaa aacatacacc 


6660 


gtaacgagga 


ccgagaaata cagttttgtt 


6720 


ggctatcaaa 


atccgaatca ttggagccag 


6780 


agccgagtaa 


ttgaattgac cggatcttgg 


6840 


ggaatttaca 


cgctgacgct gcctgcggac 


6900 


aataatggca 


gcgcccaagt gcccggtcag 


6960 


ggtttatata 


atgactcggg cttaagcggt 


7020 


ggacttaccg 


aaagaaacca tcaatgatgg 


7080 


ttttctcttg 


caaaagtttg tgaagtgttg 


7140 


aaaaagacat 


caaagagaaa cataccctgc 


7200 


ggagcaaacc 


aaaaacctgg tgagacacgc 


7260 


ttcaaaggca 


tcgttagtga agtcatggcg 


7 320 


cggcacctcg 


ctaacggatt caccactcca 


7380 


actgtgaatg 


cgcaaaccaa cccttggcag 


7440 


agccgcacgc 


ggcgcatctc gqqcaqcqtt 


7500 


ctcctgtcgt 


tgaggacccg gctaggctgg 


7560 


tcaccgatac 


gcgagcgaac gtgaagcgac 


7620 


acaacatgaa 


tggtcttcgg tttccgtgtt 


7680 


ccctgcacca 


ttatgttccg gatctgcatc 


7740 


cctacatctg 


tattaacgaa gcgctggcat 


7800 
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tgaccctgag tgatttttct ctggtcccgc =cgcatccata ccgccagttg tttaccct-ca 78-60 
caacgttcca gtaaccgggc atgttcatca tcagtaaccc gtatcgtgag -catcctct-ct 7 920 
cgtttcatog gtatcattac ccccatgaac agaaafccccc cttacacgga ggcat>cagtg 7980 
accaaacagg aaaaaaccgc -ccttaacatg gcccgcttta tcagaagcca gacattaacg 8040 
cttctggaga aactcaacga gctggacgcg gatgaacagg cagacaOctg tgaatcgctt 81O0 
cacgaccacg ctgatgagct ttaccgcagc tgcctcgegc gtttcggtga tgac^gtgaa 8160 
aacctctgac acatgcagct -cccggagacg gtcacagctt gt-ctgtaagc ggatgccggg 8220 
agcagacaag cccgtcaggg cgcgfrcagcg -ggt^ttggcg ggtgtcgggg cgcagccatg 8280 
acccagtcac gtagcgatag cggagt^tat actggct-taa ctatgcggca fecagagcaga 8340 

ttgtactgag agtgcaccat atgcggtgtg aaataccgca cagatgcgta aggagaaaat 8400 

i 

.accgcatcag gcgctcttcc gctt<:ctcgc tcactgactc gctgcgctcg gt-cgttcggc 8460 
tgcggcgagc ggtatcagct cactxraaagg cggtaatacg gttatccaca gaa&cagggg 8520 
ataacgcagg aaagaacatg tgagcaaaag gccagcaaaa ggccaggaac cgtaaaaagg -8580 
ccgcgttgct ggcgtttttc cataggctcc gcccccctga -cgagcatcac aaaaatcgac 8640 
gctcaagtca gaggtggcga aacccgacag gactataaag ataccaggcg t-ttccccctg 8700 
gaagctccct cgtgcgctct xctgttccga ccctgocgct taccggatac ctgtecgcct 8*750 
ttctcccttc gggaagcgtg gcgctttctc atagctcaog ctgtaggtat -ctcagttcgg 8820 
tgtaggtcgt tcgctccaag ctgggctgtg tgcacgaacc -ccccgttcag <rccgaccgct 8880 
gcgccttatc cggtaactat cgtcttgagt ccaacccggt aagacacgac ttatcgccac 8940 
tggcagcagc cactggtaac aggattagca gagcgaggta tgtaggcggt ^gctacagagt 9000 
tcttgaagtg gtggcctaac tacggctaca ctagaaggac agtatttggt atctgcgctc 9060 
tgctgaagcc agttacct^c ggaaaaagag-ttggtagctc ttgabccggc aaacaaacca 9120 
ccgctggtag cggtggtttt tttxgtttgca agca^cagat tacgcgcaga aaaaaaggat 9180 
ctcaagaaga tcctttgatc ttttctacgg ggtctgacgc tcagtggaac <gaaaactcac 9240 
gttaagggat tttggtcatg agattatcaa aaaggatctt eacctagatc cttttaaatt "9300 
aaaaatgaag ttttaaatca atctaaagta tatatgagta aacttggtct gacagttacc 9360 
aatgcttaat cagtgaggca cctatctcag cgatctgtct atttcgttca tccatagtt^g 9420 
cctgactccc cgtcgtgtag ataactacga tacgggaggg cttaccatct ggccccagtg 9480 
ctgcaatgat accgcgagac ccacgctcac -cggct-ccaga tttatcagca ataaaccagc 9540 
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cagccggaag ggccgagcgc agaagtggtc ctgcaacttt atccgcctcc atccagtcta 9600 

ttaattgttg ccgggaagct agagtaagta gttcgccagt taatagtttg cgcaacgttg 9660 

ttgccattgc tgcaggcatc gtggtgtcac gctcgtcgtt tggtatggct tcattcagct 9720 

ccggttccca acgatcaagg cgagttacat gatcccccat gttgtgcaaa aaagcggtta 9780 

gctccttcgg tcctccgatc gttgtcagaa gtaagttggc cgcagtgtta tcactcatgg 9840" 

ttatggcagc actgcataat tctcttactg tcatgccatc cgtaagatgc ttttctgtga 9900 

ctggtgagta ctcaaccaag tcattctgag aatagtgtat gcggcgaccg agttgctctt 9960 

gcccggcgtc aacacgggat aataccgcgc cacatagcag aactttaaaa gtgctcatca 10020 

ttggaaaacg ttcttcgggg cgaaaactct caaggatctt accgctgttg agatccagtt 1O0S0 

cgatgtaacc cactcgtgca cccaactgat cttcagcatc ttttactttc accagcgttt 10140 

ctgggtgagc aaaaacagga aggcaaaatg ccgcaaaaaa gggaataagg gcgacacgga 10200 

aatgttgaat actcatactc ttcctttttc aatattattg aagcatttat cagggttatt 10260 

gtctcatgag cggatacata tttgaatgta tttagaaaaa taaacaaata ggggttccgc 10320 

gcacatttcc ccgaaaagtg ccacctgacg tctaagaaac cattattato atgacattaa 10380 

cctataaaaa taggcgtatc acgaggccct ttcgtcttca agaatt 10426 

<210> 83 
<211> 4191 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Recombinant 
pAN263 plasmid 

<400> 83 

ccaatacgca aaccgcctct ccccgcgcgt tggccgattc attaatgcag cccggatccg 60 

aaagaagagg atatcccggt tttacagaag gcattggatg atccaaaggt gtccatcaga 120 

agacaggctg ttgtgtactt aggaatgatt gaaacacctg atgttcttcc tctattgtat 180 

aaagcacttg aggacaaagc tgtatcagtc agaagaacgg ccggagactg cctgtctgat 240 

atcggcgatc ctcaagccat tcctgctatg atcaagtcat taagcgactc cagcaagctt 300 

gttcgctggc gtgccgccat gttcctgtac gaagtcggcg atgaaagtgc aattgaagct 360 

ttgcgcgctg ccgaagatga ccccgaattt gaggtcagcc ttcaagtcaa aatggcgctt 420 

gaacgtattg agcatggaga agaagcaaaa ggttctgttt ggaaacaaat gacggaaagc 4 80 
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agaaaaaaag gcgaataaag ataaaaaagg 
ggtcgaccga aacagcagtt ataaggcatg 
gactgtaaaa agaaatcgaa aaagaccgtt 
taaccaactg -ccataactcg aggcctacct 
agagcggaaa gatgttttgt tctacatcca 
tttgcaaaaa gttgttgact ttatctacaa 
gacgctctag aggaggagac accatggcag 
tcgatagagc tccgcaccgc agtttgcttc 
gcaagccgtt tattgcggtg tgtaattcat 
tgcaggagtt tgggaaaatc gtaaaagaag 
aatttaatac cattggggta gatgatggca 
cgctgccaag ccgtgaaatt atcgcagact 
ttgacggaat ggtctgtatt ccgaactgcg 
caatgcgcat <:aacattccg acgatttttg 
caagttacgg gcgaaaaatc tccctttcct 
cagggaaaat caacgaaaac gagcttcaag 
ggtcttgctc aggcatgttt acggcgaact 
ttgctttgcc gggtaatgga accattctgg 
gaaaatcggc tgcgcaatta atggaaacga 
ttacagtaaa agcgattgat aacgcgtt^tg 
ataccgttct tcataccctt gcccttgcaa 
gcattaacga agtcgctgag cgcgtgccgc 
tgtttattga agatcttcac gaagcgggcg 
agaaagaagg agcgcttcat ttagatgcgc 
ccattgccgg acatgaagta aaggattatg 
ctgaaaaggg aggccttgct gttttattcg 
aaacaggcgg cgtacagaat gggattacaa 
ctcaggacga ggcgcttgac ggcattatca 
tcatcagata cgaagggcca aaaggcggac 
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tgcagatcat gcaccttttt tatgt'gaatt 54 0 
aagctgfcccg gtttttgcaa aagtggct<gt 600 
ttgtgtgaaa acggtctttt tgtttccttt 660 
agcttccaag aaagatatcc taacagcaca 720 
gaacaacctc tgctaaaatt cctgaaaaat 7€0 . 
ggtgtggtat aataat<:tta acaacagcag 340 
aattacgcag taatatgatc acacaaggaa 900 
gtgcagcagg ggtaaaagaa gaggattt<:g 960 
acattgatat <:gt*:<:ccggt catgttcact 10 20 
-caatcagaga agcagggggc ^gttccgtttg 1O80 
tcgcaatggg gcatat»cggt atgagatat* 1140 
ctgtggaaac ^fgttgtatcc gcacactggt 1200 
acaaaatcac accgg^aatg -cttatggegg 12-60 
tcagcg<jcgg accgatggcg -gcaggaagaa 1320 
cagtattcga aggggtaggc gcctaccaag 1380 
aactagagca gtt<cggatgc ^ccaaegtgcg 14 40 
caatgaactg £ctgtcagaa gcacttggtc 1500 
caacabctcc ggaacgcaaa gagtt*tgtga 15-60 
ttcgcaaaga tatcaaaccg -cgtgatattg 1620 
cactcgatat ggcgctcgga ggt^tacaa 1*680 
acgaagccgg cgttgaatac tctttagaac 17 40 
acttggctaa gctggcgcct -gcatcggatg 1800 
gcgtttcagc ggetctgaat .gagctttcga 1860 
tgactgttac aggaaaaact cttggagaaa 1920 
acgtcattca cccgctggat caaccatt<:a 1980 
gtaatctagc tccggacggc ^ctatcatta 2040 
gacacgaagg gccggctgtc gtattcgatt 2100 
accgaaaagt aaaagaaggc -gacgttgtca 2160 
ctggcatgcc -ggaaaJtgctg gcgccaacat 2220 
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cccaaatcgt tggaatggga ctcgggccaa aagtggcatt gattacggac ggacgttttt 2280 
ccggagcctc ccgtggcctc tcaatcggcc acgtatcacc tgaggccgct gagggcgggc 2340 
cgcttgcctt tgttgaaaac ggagaccata ttatcgttga tattgaaaaa cgcatcttgg 2400 
atgtacaagt gccagaagaa gagtgggaaa aacgaaaagc gaactggaaa ggttttgaac 24 60 
cgaaagtgaa aaccggctac ctggcacgtt attctaaact tgtgacaagt gccaacaccg 2520 
gcggtattat gaaaatctag acccctggcg taatagcgaa gaggcccgca ccgatcgccc 2580 
ttcccaacag ttgcgcagcc tgaatggcga atgagcttgc gccgtcccgt caagtcagcg 2640 
taatgctctg ccagtgttac aaccaattaa ccaattctga ttagaaaaac tcatcgagca 2700 
tcaaatgaaa ctgcaattta ttcatatcag gattatcaat accatatttt tgaaaaagcc .2760 
gtttctgtaa tgaaggagaa aactcaccga ggcagttcca taggatggca agatcctggt 2820 
atcggtctgc gattccgact cgtccaacat caatacaacc tattaatttc ccctcgtcaa 2880 
aaataaggtt atcaagtgag aaatcaccat gagtgacgac tgaatccggt gagaatggca 2940 
aaaggttatg catttctttc cagacttgtt caacaggcca gccattacgc tcgtcatcaa 3000 
aatcactcgc atcaaccaaa ccgttattca ttcgtgattg cgcctgagcg agacgaaata 3060 
cgcgatcgct gttaaaagga caattacaaa caggaatcga atgcaaccgg cgcaggaaca 3120 
ctgccagcgc atcaacaata ttttcacctg aatcaggata ttcttctaat acctggaatg 3180 
ctgttttccc agggatcgca gtggtgagta accatgcatc atcaggagta cggataaaat 3240 
gcttgatggt cggaagaggc ataaattccg tcagccagtt tagtctgacc atctcatctg 3'300 
taacatcatt ggcaacgcta cctttgccat gtttcagaaa caactctggc gcatcgggct 33€0 ■ 
tcccatacaa tcaatagatt gtcgcacctg attgcccgac attatcgcga gcccatttat 3420 
acccatataa atcagcatcc atgttggaat ttaatcgcgg cctcgacgag caagacgttt 3480 
cccgttgaat atggctcata acaccccttg tattactgtt tatgtaagca gacagtttta 3540 
ttgttcatga tgatatattt ttatcttgtg caatgtaaca tcagagattt tgagacactc 3600 
gacaagatga tcttcttgag atcgttttgg tctgcgcgta atctcttgct ctgaaaacga 3660 
aaaaaccgcc ttg6agggcg gtttttcgaa ggttctctga gctaccaact ctttgaaccg 3720 
aggtaactgg cttggaggag cgcagtcacc aaaacttgtc ctttcagttt agccttaacc 3780 
ggcgcatgac ttcaagacta actcctctaa atcaattacc agtggctgct gccagtggtg 3840 
cttttgcatg tctttccggg ttggactcaa gacgatagtt accggataag gcgcagcggt 3900 
cggactgaac ggggggttcg tgcatacagt ccagcttgga gcgaactgcc tacccggaac 3960 
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tgagtgtcag gcgtggaatg agacaaacgc ggccataaca gcggaatgac accggtaaac 4020 
cgaaaggcag gaacaggaga gcgcacgagg gagccgccag gggaaacgcc tggcatcttt 4080 
atagtcctgt cgggtttcgc caccactgat ttgagcgtca gatcccgtga tgcttgtcag 4140 

gggggcggag cctatggaaa aacggctttg ccgcggccct cteacttccc t 4191 

i 

<210> 84 • 
<211> 702 
<212> DNA 

<213> Bacillus subtiiis 

<220> 

<221> CDS 

<222> (1) . . (£99) 

<400> 84 

ttg tta ctg gtt ate gat gtg ggg aac acc aat act gta ctt ggt gta 48 
Met Leu Leu Val lie Asp Val Gly Asn Thr Asn Thr Val Leu Gly Val 
5 10 15 

tat cat gat gga aaa tta gaa tat cac tgg cgt ata gaa aca age agg 9t> 
Tyr His Asp Gly Lys Leu Glu Tyr His Trp Arg He Glu Thr Ser Arg 
20 25 30 

eat aae aca gaa gat gag ttt ggg atg act ttg cgc tec tta ttt gat 14 4 
His Lys Thr Glu Asp Glu Phe tSly Met lie Leu Arg Ser Leu Phe Asp 
35 40 45 

cac tec ggg ctt atg ttt gaa cag ata gat ggc act att att teg tea 192 
His Ser Gly Leu Met Phe -Glu Gin He Asp Gly He He He Ser Ser 
50 55 SO 

gta gtg ccg cca ate atg ttt gcg tta gaa aga atg tgc aca aaa tae 240 
Val Val Pro Pro He Met Phe Ala Leu Glu Arg Met Cys Thr .Lys Tyr 
65 70 75 80 

ttt cat ate gag cct eaa att gtt ggt eca ggt atg aaa aec ggt tta 288 
Phe His He Glu Pro Gin He Val Gly Pro Gly Met Lys Thr Gly Leu 
85 90 95 

aat ata aaa tat gac aat ccg aaa gaa gta ggg gea -gac aga ate gta 336 
Asn He Lys Tyr Asp Asn Pro Lys Glu Val "Gly Ala Asp Arg He Val 
100 105 110 

aat get gtc get gcg ata cac ttg tac ggc aat eca tta att gtt gtc 384 
Asn Ala Val Ala Ala lie His Leu Tyr Gly Asn Pro Leu He Val Val 
115 120 125 

gat ttc gga acc gee aca acg tac tgc tat att gat gaa aac aaa eaa 4 32 
Asp Phe Gly Thr Ala Thr Thr Tyr Cys Tyr He Asp <Glu Asn Lys -Gin 
130 135 140 

tac atg ggc ggg gcg att gee cct ggg att aca att teg aca gag gcg 4 80 
Tyr Met Gly Gly Ala He Ala Pro Gly He Thr He Ser Thr Glu Ala 
145 150 155 ISO 
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ctt tac teg cgt gca gca aag ctt cct cgt ate gaa ate ace egg ccc 528 

Leu Tyr Ser Arg Ala Ala ,Lys Leu Pro Arg He Glu He Thr Arg Pro 
165 170 115 

gac aat att ate gga aaa aac act gtt age gcg atg caa tct gga att 576 

Asp Asn He He Gly Lys Asn Thr Val Ser Ala Met Gin Ser Gly He 
180 185 190 

tta ttt ggc tat gtc ggc caa gtg gaa gga ate gtt aag cga atg aaa 624 

Leu Phe Gly Tyr Val Gly Gin Val Glu Gly He Val Lys Arg Met Lys 

195 200 205 

tgg cag gca aaa cag gac cca agg tea ttg cga cag gag gee tgg cgc 672 

Trp Gin Ala Lys Gin Asp Pro Arg Ser Leu Arg Gin Glu Ala Trp Arg 
210 215 220 

cgc tea ttg cga acg aat cag att gta tag 702 
Arg Ser Leu Arg Thr Asn Gin He Val 

225 230 • 



<210> 85- 
<211> 233 
<212> PRT 

<213> Bacillus subtilis 
<400> 85 

Met Leu Leu Val He Asp Val Gly Asn Thr Asn Tnr Val Leu Gly v a i 
1 5 10 L5 

Tyr His Asp Gly Lys Leu Glu Tyr His Trp Arg He Glu Thr Ser Arg 
20 25 30 

His Lys Thr Glu Asp Glu Phe Gly Met He Leu Arg Ser Leu Phe Asp 
35 40 45 

His Ser Gly Leu Met Phe Glu Gin He Asp Gly He He He Ser Ser 
50 55 60 

Val Val Pro Pro He Met Phe Ala Leu Glu Arg Met Cys Thr Lys Tyr 
65 70 75 80 

Phe His lie Glu Pro Gin He Val Gly Pro Gly Met Lys Thr Gly Leu 
85 90 95 

Asn He Lys Tyr Asp Asn Pro Lys Glu Val Gly Ala Asp Arg He Val 
100 105 HO 

Asn Ala Val Ala Ala lie His Leu Tyr Gly Asn Pro Leu He Val Val 
115 120 125 

Asp Phe Gly Thr Ala Thr Thr Tyr Cys Tyr lie Asp Glu Asn Lys Gin 
130 135 140 

Tyr Met Gly Gly Ala lie Ala Pro Gly lie Thr He Ser Thr Glu Ala 
145 150 155 160 
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Leu Tyr Ser Arg Ala Ala Lys Leu *ro Arg lie Glu He Thr Arg Pro 
165 170 175 

Asp Asn He He Gly Lys Asn Thr Val Ser Ala Met Gin Ser Gly He 
180 185 190 

Leu Phe Gly Tyr Val Gly Gin Val Glu Gly lie Val Lys Arg Met Lys 
195 200 205 

Trp Gin Ala Lys Gin Asp Pro Arg Ser Leu Arg -GlnXSlu Ala Trp Arg 
210 215 220 

Arg Ser Leu Arg Thr Asn Gin He Val 
225 230 

<210> 86 
<211> 1623 

<212> DNA . . 

<213> Bacillus subtilis 

<220> 1 
,<221> CDS 
<222> (1) . . (1620) 

<400> 86 

atg tat ttg gca ttc cag gtg caa aaa ttg atg egg tat ttg acg ctt 

Met Tyr Leu Ala Phe Gin Val Gin 'Lys Leu Met Arg Tyr Leu Thr Leu 
1 5 10 15 



48 



tac aag ata aag gac ctg aaa tta teg ttg ccc ggc acg aac aaa acg 96 
Tyr Lys He Lys Asp Leu Lys Leu Ser Leu Pro Gly Thr Asn Lys Thr 
20 . 25 . 30 

cag caa ttc atg gec caa gca gtc ggc cgt tta act gga aaa ccg gga 14 4 
Gin Gin Phe Met Ala <Sln Ala Val Gly Arg Leu Thr Gly Lys Pro Gly 
35 40 45 

gtc gtg tta gtc aca tea gga ccg ggt gec tct aac ttg gca aca ggc 
Val Val Leu Val Thr Ser Gly Pro Gly Ala Ser Asn Leu Ala Thr -Gly 
50 55 €0 

ctg ctg aca gcg aac act gaa gga gac cct gtc gtt gcg ctt get gga 
Leu Leu Thr Ala Asn Thr Glu Gly Asp Pro Val Val Ala Leu Ala Gly 
65 70 75 80 

aac gtg ate cgt gca tat cgt tta aaa egg aca cat caa tct ttg gat 
Asn Val He Arg Ala Tyr Arg Leu Lys Arg Thr His Gin Ser Leu Asp 
85 90 95 

aat gcg gcg eta ttc cag ccg att aca aaa tac agt gta gaa gtt caa 
Asn Ala Ala Leu Phe Gin Pro He Thr Lys Tyr Ser Val Glu Val Gin 
100 105 110 

gat gta aaa aat ata ccg gaa get gtt aca aat gca ttt agg ata gcg 
Asp Val Lys Asn He Pro Glu Ala Val Thr Asn Ala Phe Arg He Ala 
115 120 125 

tea gca ggg cag get ggg gee get ttt gtg age ttt ccg caa gat gtt 4 32 



192 



240 



288 



336 



384 
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Ser Ala Gly Gin Ala Gly Ala Ala Phe Val Ser Phe Pro Gin Asp Val 
130 135 140 

gtg aat gaa gtc aca aat acg aaa aac gtg cgt get gtt gca gcg cca 480 

Val Asn Glu Val Thr Asn Thr Lys Asn Val Arg Ala Val Ala Ala Pro 

145 ' 150 155 160 

i 

aaa etc ggt cct gca gca gat gat gca ate agt gcg gee ata gca aaa 528 

Lys Leu Gly Pro Ala Ala Asp Asp Ala lie Ser Ala Ala He Ala Lys 
165 170 175 

ate caa aca gca aaa ctt cct gtc gtt ttg gtc ggc atg aaa ggc gga 576 

He Gin Thr Ala Lys Leu Pro Val Val Leu Val Gly Met Lys Gly Gly 
180 185 190 

aga ccg gaa gca att aaa gcg gtt cgc aag ctt ttg aaa aag gtt cag 624 

Arg Pro Glu Ala He Lys Ala Val Arg Lys Leu Leu Lys Lys Val Gin 
195 200 205 

ctt cca ttt gtt gaa aca tat caa get gee ggt acc ctt* tct aga gat 672 

Leu Pro Phe Val Glu Thr Tyr Gin Ala Ala Gly Thr Leu Ser Arg Asp 
210 215 220 

tta gag gat caa tat ttt ggc cgt ate ggt ttg ttc cgc aac cag cct 720 

Leu Glu Asp Gin Tyr Phe Gly Arg Tie Gly Leu Phe Arg Asn Gin Pro 

225 230 235 ' 240 

ggc gat tta ctg eta gag cag gca gat gtt gtt ctg acg ate ggc tat 768 

Gly Asp Leu Leu Leu Glu Gin Ala Asp Val Val Leu Thr He Gly Tyr 
245 250 255 

gac ccg att gaa tat gat ccg aaa ttc tgg aat ate aat gga gac egg 816 

Asp Pro He Glu Tyr Asp Pro Lys Phe Trp Asn He Asn Gly Asp Arg 
260 265 270 

aca att ate cat tta gac gag att ate get gac att gat cat get tac 864 

Thr He He His Leu Asp Glu lie He Ala Asp He Asp His Ala Tyr 
275 280 285 

cag cct gat ctt gaa ttg ate ggt gap att ccg tec acg ate aat cat 912 

Gin Pro Asp Leu Glu Leu He Gly Asp He Pro Ser Thr He Asn His 
290 295 300 

ate gaa cac gat get gtg aaa qtg gaa ttt gca gag cgt gag cag aaa 960 

He Glu His Asp Ala Val Lys Val Glu Phe Ala Glu Arg Glu Gin Lys 

305 310 315 . " 320 

ate ctt tct gat tta aaa caa tat atg cat gaa ggt gag cag gtg cct 1008 

lie Leu Ser Asp Leu Lys Gin Tyr Met His Glu Gly Glu Gin Val. Pro 
325 330 335 

gca gat tgg aaa tea gac aga gcg cac cct ctt gaa ate gtt aaa gag 1056 

Ala Asp Trp Lys Ser Asp Arg Ala His Pro Leu Glu He Val Lys Glu 
340 345 350 

ttg cgt aat gca gtc gat gat cat gtt aca gta act tgc gat ate ggt 1104 

Leu Arg Asn Ala Val Asp Asp His Val Thr Val Thr Cys Asp He Gly 
355 360 365 
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tcg cac tec att tgg atg tea cgt tat ttc cgc age -tac gag ccg tta 1152 

Ser His Ser lie Trp Met Ser Arg Tyr Phe Arg Ser Tyr Glu Pro Leu 

370 375 380 

i 

aca tta atg ate agt aac ggt atg caa aca etc ggc gtt gcg ctt cet 1200 

Thr Leu Met He Ser Asn Gly Met Gin Thr Leu Gly Val Ala Leu Pro 

385 390 395 400 

tgg gca ate ggc get tea ttg gtg aaa ccg gga gaa aaa gtg gtt tet 124 8 
Trp Ala He Gly Ala Ser Leu Val Lys Pro Gly Glu Lys Val Val Ser 
4 05 410 415 

gtc tet ggt gac ggc ggt ttc tta ttc tea gca atg gaa tta gag aca 1296 
Val Ser Gly Asp Gly Gly Phe Leu Phe Ser Ala Met Glu Leu <31u Thr 
420 425 430 

gca gtt cga eta aaa gca cca att gta cac att gta tgg ,aae gae age 134 4 
Ala Val Arg Leu Lys Ala Pro He Val His He Val Trp Asn Asp Ser 
435 440 445 

aca tat gac atg gtg cat ttc eag caa ttg aaa aaa tat aac egt aca 1392 
Thr Tyr Asp Met Val His Phe Gin Gin Leu Lys Lys Tyr Asn Arg Thr 
4 50 455 4 60 

tet gcg gtc gat ttc gga aat ate gat ate gtg aaa tat gcg gaa age 1440 
Ser Ala Val Asp Phe -Gly Asn He Asp He Val Lys Tyr Ala Glu Ser 
465 470 475 480 

ttc gga gca act gcg ttg cge gta gaa tea cca gac eag etg gca gat 1488 
Phe Giy Ala Thr Ala Leu Arg Val Glu Ser Pro Asp Gin Leu Ala Asp 
485 490 495 

gtt ctg cgt caa ggc atg aac get yaa ggt ect gte ate ate gat gtc 1536 
Val Leu Arg Gin Gly Met Asn Ala Glu -Gly Pro Val He He Asp Val 
500 505 510 

ccg gtt gac tac agt gat aac att aat tta gca agt gac aag ctt ccg 1584 
Pro Val Asp Tyr Ser Asp Asn He Asn' Leu Ala Ser Asp Lys Leu Pro 
515 520 525 

aaa gaa ttc ggg gaa etc atg aaa acg aaa get etc tag 1623 
Lys Glu Phe Gly Glu Leu Met Lys Thr Lys Ala Leu 
530 535 540 



<210> 87 
<211> 540 
<212> PRT 

<213> Bacillus subtilis 
<400> 87 

Met Tyr Leu Ala Phe Gin Val <51n Lys Leu Met Arg Tyr Leu Thr Leu 
15 10 15 

Tyr Lys He Lys Asp Leu Lys Leu Ser Leu Pro Gly Thr Asn Lys Thr 
20 25 30 
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Gin Gin Phe Met Ala Gin Ala Val Gly Arg Leu Thr Gly Lys Pro Gly 
35 40 45 

Val Val Leu Val Thr Ser Gly Pro Gly Ala Ser Asn Leu Ala Thr Gly 
50 55 60 

'Leu Leu Thr Ala Asn Thr Glu Gly Asp Pro Val Val Ala Leu Ala Gly 
65 70 75 80 

Asn Val He Arg Ala Tyr Arg Leu Lys Arg Thr His Gin Ser' Leu Asp 
85 90 95 

Asn Ala Ala Leu Phe Gin Pro He Thr Lys Tyr Ser Val Glu Val Gin 
100 105 ' HO 

Asp Val Lys Asn He Pro Glu Ala Val Thr Asn Ala Phe Arg He Ala 
115 120 125 

Ser Ala Gly Gin Ala Gly Ala Ala Phe Val Ser Phe Pro Gin Asp Val 
130 135 140 

Val Asn Glu Val Thr Asn Thr Lys Asn Val Arg Ala Val Ala Ala Pro 
145 150 155 160 

Lys Leu Gly Pro Ala Ala Asp Asp Ala He Ser Ala Ala He Ala Lys 
165 170 175 

lie Gin Thr Ala Lys Leu Pro Val Val Leu Val Gly Met Lys Gly Gly 
180 185 ~ 190 

Arg Pro Glu Ala He Lys Ala Val Arg Lys Leu Leu Lys Lys Val Gin 
195 200 205 

Leu Pro Phe Val Glu Thr Tyr Gin Ala Ala Gly Thr Leu Ser Arg Asp 
210 215 . 220 

Leu Glu Asp Gin Tyr Phe Gly Arg lie Gly Leu Phe Arg Asn Gin Pro 
225 230 235 240 

Gly Asp Leu Leu Leu Glu Gin Ala Asp Val Val Leu Thr He Gly Tyr 
245 250 255 

Asp Pro He Glu Tyr Asp Pro Lys Phe Trp Asn He Asn Gly Asp Arg 
260 265 270 

Thr He He His Leu Asp Glu He He Ala Asp He Asp His Ala Tyr 
275 280 285 

Gin Pro Asp Leu Glu Leu He Gly Asp He Pro Ser Thr He Asn His 
290 295 300 

He Glu His Asp Ala Val Lys Val Glu Phe Ala Glu Arg Glu Gin Lys 
305 310 315 320 



He Leu Ser Asp Leu Lys Gin Tyr Met His Glu Gly Glu Gin Val Pro 
325 330 335 

Ala Asp Trp Lys Ser Asp Arg Ala His Pro Leu Glu lie Val Lys Glu 



» 
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340 345 350 

Leu Arg Asn Ala Val Asp Asp His Val Thr Val Thr Cys Asp lie Gly 
355 360 365 

Ser His Ser He Trp Met Ser Arg Tyr Phe Arg Ser Tyr <21u Pro Leu 
370 375 380 

Thr Leu Met He Ser Asn Gly Met Gin Thr Leu Gly Val Ala Leu Pro 
385 390 395 400 . 

Trp Ala He Gly Ala Ser Leu Val Lys Pro Gly Glu Lys Val Val Ser 
405- 410 415 

Val Ser Gly Asp Gly Gly Phe Leu Phe Ser Ala Met <31u Leu Glu Thr 
420 425 430 

Ala Val Arg Leu Lys Ala Pro He Val His He Val Trp ( Asn Asp Ser 
435 440 445 

Thr Tyr Asp Met Val His Phe <Un Gin Leu Lys Lys Tyr Asn Arg Thr 
450 * 455 460 

Ser Ala Val Asp Phe Gly Asn He Asp lie Val <Lys Tyr Ala Glu Ser 
465 470 475 480 

Phe Gly Ala Thr Ala Leu Arg Val Glu Ser Pro Asp -Gin Leu Ala Asp 
485 490 495 

Vai Leu Arg Gin Gly Met Asn Ala -Glu Gly Pro Val He He Asp Val 
500 505 510 

Pro Vai Asp Tyr Ser Asp Asn He Asn Leu Ala Ser Asp Lys Leu Pro 
515 520 525 

Lys Glu Phe Gly Glu Leu Met Lys Thr Lys Ala Leu 
530 " 535 540 



<210> 88 
<211> 23 
<212> DNA 

<213> Artificial Sequence' 
<220> 

<223> Description of Artificial Sequence: ribosome 
binding site 

<220> 

<223> All occurrences of n indicate any nucleotide 
<400> 88 

agaaaggagg tgannnnnnn atg 23 



<210> 89 
<211> 7 
<212> PRT 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PanC 
C terminus 

'<400> 89 

lie Arg Glu Met Glu Arg He 
1 5 



<210> 90 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PanC 
C terminus 

<400> 90 

He Arg Glu Arg Arg 
1 5 



<210> 9i 
, <211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> .Description of Artificial Sequence: PanC 
C terminus 

<400> 91 

He Arg Arg Lys Glu Val Asn 
1 5 

<210> 92 

<211> 6688 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Recombinant 
pAN336 plasmid 

<400> 92 

tgcgccgcta cagggcgcgt ccattcgcca ttcaggctgc gcaactgttg ggaagggcga 60 
tcggtgcggg cctcttcgct attacgccag ctggcgaaag ggggatgtgc tgcaaggcga 120 
ttaagttggg taacgccagg gttttcccag tcacgacgtt gtaaaacgac ggccagtgaa 180 
ttgtaatacg actcactata gggcgaattg ggcccgacgt cgcatgcacc aggcttctca 240 
ggcgctgact tagaaaacct cttgaatgaa gctgcgcttg tagcggctcg tcaaaacaag 300 
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aaaaaaatcg atgcgcgtga tattgacgaa gcgacggacc gt-gtaattgc cggacccgct 360 
aagaagagcc gcgttatctc caagaaagaa cgcaatatcg tggcttiatca cgaaggcgga 4 20 
cacaccgtta tcggtctcgt tttagatgag gcagatatgg ttcataaagt aacgattgtt 480 
cctcggggcc aggctggcgg ttatgctgtt atgctgccaa gagaagaccg ttatttccaa 540 
acaaagccgg agctgcttga taaaattgtc ggcctcttgg gcggacgtgt tgctgaagag "600 
attatcttcg gtgaagtcag cacaggggcg cacaatgact tccagcgt-gc gacgaatatt 660 
gcaagacgaa tggttacaga attcggtatg tcagaaaaac tgggaccgtt gcaatttgga 720 
cagtctcagg gcggt-caggt attcttaggc cgtgatttca acaacgaaca -gaactacagt 780 
gatcaaatcg cttacgaaat tgatcaggaa attcagcgca tcatcaaaga atgttatgag 840 
cgtgcgaaac aaatcctgac tgaaaatcgt gacaagcttg aattgattgc ccaaacgctt 900 
ctgaaagttg aaacgcttga cgctgaacaa atcaaacacc ttatcgatca tggaacatta 960 
cctgagcgta atttctcaga tgatgaaaag aacgatgatg tgaaagtaaa cat-tctgaca 1O20 
aaaacagaag aaaagaaaga cgatacgaaa gag-taattcg cttfrctttct aaaaaaactg 1O80 
ccggctgacg ctggcagttt tttitatgtaa atgattggct cagctgcggc ttttacaatc 1140 
atccaattct ggtatcgatt tgtttacaaa tgagccgctg atcgtgtatg gtattgtaga 1200 
atgtttgtaa aaagtaaagt agagaaacta tt<:aaaagtg gtgatagagg ttgttactgg. 1260 
ttatcgatgt ggggaacacc ctgcagctcg agtgaaatac cgcacagatg cgtaaggaga 1320 
aaataccgca tcaggcgata aacccagcga accatttgag gtgataggta agattatacc 1380 
gaggtatgaa aacgagaatt ggacctttac agaattactc tatgaagcgc -catat-ttaaa 1440 
aagctaccaa gacgaagagg atgaagagga tgaggaggca gattgccttg aatatattga 1500 
caatactgat aagataatat abcttttata tagaagatat .cgccgta-tgt aaggatttca 1560 
gggggcaagg cataggcagc gcgcttatca atatatctat agaatgggca aagcataaaa 1620 
acttgcatgg actaatgctt gaaacccagg acaataacct tatagcttgt aaattctatc 1680 
ataattgtgg tttcaaaatc ggcfcccgtcg atactatgtt atacgccaac tttcaaaaca .1740 
actttgaaaa agctgttttc tggtatttaa ggttttagaa tgcaaggaac agtgaattgg 1800 
agttcgtctt gttataatta gcttxittggg gtatctttaa atactgtaga aaagaggaag 1660 
gaaataataa atggctaaaa tgagaatatc accggaattg aaaaaactga -tcgaaaaata 1920 
ccgctgcgta aaagatacgg aaggaatgtc tcctgctaag gtatataagc tggtgggaga 1980 
aaatgaaaac ctatatttaa aaatgacgga cagccggtat aaagggacca cctatgafcgt 2040 
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ggaacgggaa aaggacatga tgctatggct ggaaggaaag ctgcctgttc caaaggtcct 2100 
gcactttgaa -cggcatgatg gctggagcaa tctgctcatg agtgaggccg atggcgtcct 2160 
ttgctcggaa gagtatgaag atgaacaaag ccctgaaaag attatcgagc tgtatgcgga 2220 
-gtgcatcagg ctctttcact ccatcgacat atcggattgt ccctatacga atagcttaga 2280 
cagccgctta gccgaattgg attacttact gaataacgat ctggccgatg tggattgcga 234 0 
aaactgggaa gaagacactc catttaaaga tccgcgcgag ctgtatgatt ttttaaagac 2400, 
ggaaaagccc gaagaggaac ttgtcttttc ccacggcgac ctgggagaca gcaacatctt 2460 
tgtgaaagat ggcaaagtaa gtggctttat tgatcttggg agaagcggca gggcggacaa 2520 
gtggtatgac attgccttct gcgtccggtc gatcagggag gatatcgggg aagaacagta 2580 
tgtcgagcta ttttttgact tactggggat caagcctgat tgggagaaaa taaaatatta 2640 
tattttactg gatgaattgt tttagtacct agatttagat gtctaaaaag ctttaactac 2700 
aagcttttta gacatctaat cttttctgaa gtacatccgc aactgtccat actctgatgt 2760 
tttatatctt ttctaaaagt tcgctagata ggggtccega gcgcctacga ggaatttgta 2820 
tcgccattcg ccattcaggc tgcgcaactg ttgggaaggg cgatcggtgc ggtcgactgg 2880 
caggcaaaac aggacccaag gtcattgcga caggaggcct ggcgccgctc attgcgaacg 2940 
aatcagattg tatagacatc gttgatccat tcttaaccct aaaagggctg gaattgattt- 3000 
atgaaagaaa ccgcgtagga agtgtatagg aggtttagta atggattatt tagtaaaagc 3060 
acttgcgtat gacggaaaag ttcgggctta tgcagcgaga acgactgata tggtaaatga 3120 
ggggcagaga cgccatggta cgtggccgac agcatccgct gcactaggcc gtacaatgac 3180 
agcttcactt atgctcggcg ctatgctgaa gggcgatgat aagctgaccg tgaaaatcga 3240 
gggcggaggt ccgatcggag ctattgtagc tgatgccaat gccaaaggag aagtcagagc 3300 
ctatgtctct aacccgcaag ttcattttga tttaaatgaa caaggtaagc ttgatgtcag 3360 
acgtgcggtt ggaacaaacg gaacgttaag tgtcgtaaaa gatttaggtt tgcgcgagtt 3420 
cttcacagga caagtagaaa tcgtttcagg agaattagga gatgatttta cttactatct 3480 
tgtgtcatct gagcaggttc cttcatcagt gggcgtaggt gtgctcgtaa atcctgacaa 354 0 
taccattctt gcggcagggg gctttattat tcagctgatg ccgggaacag atgatgaaac 3600 
aatcacaaaa attgaacagc gtctatctca agtagagccg atttctaagc tcatccaaaa 3660 
agggctgaca ccagaagaaa ttttagaaga agtcctaggc gagaaacctg agattttgga 3720 
aacgatgcct gtcagattcc attgcccttg ttcaaaagaa cggttcgaaa cagccatttt 3780 



WO 01/21772 PCT/U500/25993 

• 138- 

aggactaggc aaaaaagaaa ttcaagatat gatagaagaa ^atgga<^aag ccgaagcagt 3840 
atgccatttt tgtaatgaaa agtacttatt tacaaaa^aa gagctggaag g4ctt<:gtga 3900 
ccaaactacc cgctaagctc tttagcgggt ttttaatttg agaaaagggg ^tgaaagcag 3960 
gtttgaaatc aagaacaatc tggacgcgtt ggatgcatag cttgagtatt ctatagtgfcc 4020 
acctaaatag cttggcgtaa tcatggtcat agctgtttcc tgtgtgaaat tgttat<x:gc 4080 
tcacaattcc acacaacata cgagccggaa gcataaagtg taaagcctgg ggtgcctaat 4140 
gagtgagcta actcacatta attgcgttgc gctcactgcc cgctttccag tcgggaaacc 4200 
tgtcgtgcca gctgcattaa tgaatcggcc aacgcgcggg gagaggcggt ttgcgtattg 4260 
ggcgctcttc cgcttcctcg ctcactgact <:gctgcgcfcc ggtx:gttcgg -ctgcggcgag 4 320 
cggtatcagc tcactcaaag gcggtaatac ggttatccac agaatcaggg gataacgcag 4 380 
'gaaagaacat gtgagcaaaa ggccagcaaa aggccaggaa ccgtaaaaag gccgcgttgc 4 4 40 
tggcgttttt cgataggctc cgcccccctg acgagcatca caaaaat-cga cgctcaagtc 4500 
agaggtggcg aaacccgaca ggactataaa gataccaggc gtttccccct ^ggaagctccc 4 560 
tcgtgcgctc tcctgttccg accctgccgc ttaccggata -cctgtccgcc -tttctccccc-' 4620 
cgggaagcgt ggcgctttct -catagctcac gctgtaggta t«ctcagttcg gtgtaggtcg 4-680 
ttcgctccaa gctgggctgt gt<gcacgaac cccccgttca gcccgaccgc tgcgccttafe 4740 
ccggtaacta tcgtcttgag tocaacccgg taagacacga cttatcgcca ctggcagcag 1800 
ccactggtaa caggattagc agagcgaggt atgtaggcgg tgctacagag ttcttgaagt 4-860 
ggtggcctaa ctacggctac actagaagga cagtatttgg tatctgcgct ctgctgaagc 4 920 
cagttacctt cggaaaaaga gttggtagct cttgatccgg caaacaaacc accgctggta 4 980 
gcggtggttt ttttgtttgc aagcagcaga ttacgcgcag aaaaaaagga <tctcaagaag 504O 
atcctttgat cttttctacg gggtctgacg cfccagtggsa cgaaaactca cgttaaggga 5100 
ttttggtcat gagattatca aaaaggatct tcacctagat ccttttaaat taaaaat«gaa 5160 
gttttaaatc aatctaaagt atatatgagt aaacttggtc tgacagttac caatgcttaa 5220 
tcagtgaggc acctatctca gcgatctgtc tatttcgttc atccatagtt gcctgactcc 5280 
ccgtcgtgta gataactacg atacgggagg gcttaccatc tggccccagt gctgcaatga 5340 
taccgcgaga cccacgctca ccggctccag atttatcagc aataaaccag -ccagccggaa 5400 
gggccgagcg cagaagtggt cctgcaactt tatccgcctc catccagtct at-taattgtt 5460 
gccgggaagc tagagtaagt agttcgccag ttaatagttt gcgcaacgtt gttggcattg 5520 
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ctacaggcat 


cgtggtgtca 


cgctcgtcgt 


ttggtatggc 


^ 't - Si +" ^ /■» a <r /*• 4- t*f*nr*^- +■ ******* 

LLLdu 1 cdyc tccggtcccc 


ccon 
DOoU 


aacgat caag 


acaaattaca 


tgatccccca 


tgttgtgcaa 


aaaageggre agctcctticg 


c c a r\ 
ob4 Q 


gt cctccgat 


co t t a t c a era 


agtaagttgg ccgcagtgtt 


atcacccatg gctatggcag 


□ 700 


cactgcataa 


ttctcttact 


gtcatgecat 


ccgtaagatg 


ctLtLCLgty aciggcgagt 


D /bu 


ar*t" PAacpas 




gaataccgcg cccggcgacc 


gagttgetet tgcccggcgt 


5820 


cast* afnncrp 

w o a t a v_,y y y a 


LaatayLy la 


tgacatagca 


gaactttaaa 


agtgctcatc attggaaaac 


5880 


y l. u l* l. l L-y y y 


yuyddddL ll 


tcaaggatct 


taccactatt 


gagatccagt tcgatgtaac 


5940 




dCCCddCLyd 


tct tcagcat 


cttttacttt 


caccagcgtt tctgggtgag 


6000 


r-> -3 -3 a .a .a a /"i ™ 
LodaaoLd(jy 


ddyyCdddal 


geegcaaaaa 


aaaaaataaa 


ggegacaegg aaatgttgaa 


6060 


LaCLCdLaCt 


CUtCCLLLUL 


caatattat t 




tcagggttat tgtctcatga 


6120 


a a a ^ +~ ^ ^« ^ +- 

gcgydiaCdL, 


dLtLydaLyi 


atttagaaaa 


ataaacaaat 


aggggttccg cgcacatttc 


6180 


LLLyddadyL 


gCCdCCLgLd 


tgcggtgtga 


aataccgcac 


agatgegtaa ggagaaaata 


6240 


CCyLdLCdyy 


CydadLLyLd 


aacgttaata 


ttttgttaaa 


attcgegtta aatatttgtt 


6300 


add s. LaytLL 


dLLUUlLddv. 


caataggecg 


aaatcggcaa 


aatcccttat aaatcaaaag.*' 


6360 


ad LdyduLyd 


g a tagggc eg 


agtgttgttc 


cagtttggaa 


caagagtcca ctattaaaga 


6420 


acgtggactc 


caaegtcaaa 


gggcgaaaaa 


ccgtctatca 


gggcgatggc ccactacgtg, 6480 


aaccatcacc 


caaatcaagt 


tttttgcggt 


cgaggtgccg 


taaagctcta aateggaace 


6540 


ctaaagggag 


cccccgattt 


agagcttgac 


ggggaaagee 


ggcgaacgtg gcgagaaagg ^600 


aagggaagaa 


agegaaagga 


gegggegcta 


gggcgctggc 


aagtgtagcg gtcacgctgc 


6660 


gcgtaaccac 


cacacccgcc 


gcgcttaa 






6688 



<210> 93 
<211> 8503 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Recombinant 
pAN004 plasmid 

<400> 93 

gaattttgcg gccgcttcga aagctgtaat ataaaaacct tcttcaacta aeggggcagg 60 

ttagtgacat tagaaaaccg actgtaaaaa gtacagtegg cattatctca tattataaaa 120 

gecagtcatt aggectatet gacaattcct gaatagagtt cataaacaat ectgeatgat 180 

aaccatcaca aacagaatga tgtacctgta aagatagegg taaatatatt gaattacctt 240 
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tattaatgaa ttttcctgct gtaataatgg gtagaaggta attactatta ttattgatat 300 
ttaagttaaa cccagtaaat gaagtccatg gaataataga aagagaaaaa gcattttcag 3-60 
gtataggtgt tttgggaaac aatttccccg aaccattata tttctctaca £cagaaaggt 420 
ataaatcata aaactctttg aagtcattct ttacaggagt -ccaaatacca gagaatgttt 480 
tagatacacc atcaaaaatt gtataaagtg gctctaactt atcccaataa -cctaactctc 540 
cgtcgctatt gtaaccagtt ctaaaagctg tatttgagtt ^tatcaccctt gtcactaaga *0O 
aaataaatgc agggtaaaat ttatatcctt cttgttttat gtttcggtat aaaacactaa "660 
tatcaatttc tgtggttata ctaaaagtcg -tttgttggtt caaataatga t-taaatatct ~*20 
cttttctctt ccaattgtct aaatcaattt tattaaagtt catttgatat gcctcctaaa 780 
tttttatcta aagtgaattt aggaggctta cttgtctgct ttcttcatta gaatcaatcc 840 
ttttttaaaa gtcaatatta ctgtaacata aatatatatt ttaaaaatat cccactttat 900 
ccaattttcg tttgttgaac taatgggtgc tttagttgaa gaataaagac caca-ttaaaa 960 
, aatgtggtct tttgtgtttt tttaaaggat ttgagcgtag cgaaaaatcc ttttctttct 1020 
tatcttgata ataagggtaa ctattgaatt cggtaccaag agtttgtaga aacgcaaaaa 1080 
ggccatccgt caggatggcc ttctgcttaa tttgatgcct ggcagtttat ggcgggcgtc 1140 
ctgcccgcca ccctccgggc cgttgct^cg caacgt^caa atccgctccc ggcggatttq; 12O0 
tcctactcag gagagcgttc accgacaaac aacagataaa acgaaaggcc cagtctttcg^ 1260 
actgagcctt tcgttttatt tgatgcctgg <:agtt<:ccta ctctcgcatg gggagacccc 1320 
acactaccat cggcgctacg gcgtttcact *ctgagttcg gcatggggtc aggtgggacc 1380 
accgcgctac tgccgccagg caaattctgt tttatcagac cgcttctgcg ttctgattta 14 40 
atctgtatca ggctgaaaat cttctctcat ccgccaaaao aggatcctac ggaaatggag 1500 
cggcaaaacc gttttactct caaaatctta aaagaaaacc cccgataaag ggggcttttc 1560 
ttctacaaaa ttgtacgggc tggttxrgttc cccagcattt gttcaatttt gttttgatoa 1*620 
ttcagaacag ccacttOcgg ctcatggctt gccgcttctt gatcagacat <:attttgtag 1680 
gaaataataa tgaccttatc tccttcctgc acaaggcgtg cggctgcacc gtttaagcat 1740 
atgacgccgc ttccccgttt accaggaata atatacgttt caagacgtgc tccattatta 1800 
ttattcacaa tttgtacttt ttcattagga agcattccca cagcatcaat gagat<:ttca 18*0 
tcaattgtaa tgcttcccac atagttcagg tttgcttccg taacagttgc cctgtgaagt 1920 
ttgccgctca tcattgttcg atacatatta tattctctcc atttctcgaa tatcaataat 1980 
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gatattatct attaaacgcg cttttgaaaa agcaactgca acagcgagaa tcatctttcc 2040 
agcaatttca ttcacaggct cgagttccgg ataggaataa agctctacat agtctatggt 2100 
tccgctagtc gtttcaatga tatcttttgc agcttttatc accgcttcag gatctctttc 2160 
accggcttgg acaagttccg cacttgtttg aagggcccga tacagcttag gcgcttcttt 2220 
tctttcctca gctgttaagt atacattgcg agagcttttg gctaagccgt cttcctctct 2280 
gaccgtatcg acaggaacca attcaatatc catgaagaag tcgctgatta- acccatcaac 2340 
aacagctacc tgctgcgcat cttttaaacc gaaataggca cgagtcggct tgactagatt 2400 
gaaaagcttc gtcagtacga tcgcgacccc gtcaaaatgt ccttctcttg agcgcccgca 24 60 
taacacgtct gtgcgtcttt ctacatgaat cgtgacattc ttttcaccgg gatacatatc 2520 
atgagcatct ggcgtaaaaa gaatatcgac tccggcgttt tctgcaafgag ctgcatcccg 2580 
ctcaatatcg cgcggatatg cttcaaaatc ttcattaggg ccgaattgtg caggattcac 2640 
aaaaatactc ataataacgg cgtcgttttc ttgtcttgct ttgtctgcta aggttaaatg 2700 
cccctcatgc agaaacccca tcgtcggaac aaatccgatt gacttgccct ctgaatggta 2760 
ttgttttatg gcttctttca gctgtgaaat atcagtaatc tgtctcatct tattttcccc 2820 
cgtacaagcc gtcaagcact gtctggttca tttgaaagga atgcttttgt tcagggaaag 2880 
cacgatgtct tacatcctga acatatccgc tgattgctgt ttcgatggtt tcatcaatgc- 2940 
gcgtatattg ctttacaaat ttaggtgttc tctcaacacc gtggccgata atatcatgat. 3000 
aaacgagaac ttgtccgtcc gctttcacac cagccccgat tccaatgacc ggtatgctta 3060 
gcgtctcggc aattttggct gtgagttctg ccggcacaca ttccagcaca agcatcatag 3120 
ctcctgcttc ttcgcatttt atactgtctt ctattaattt tttggcgctt tgttcgtctt 3180 
tgccctgtac tttatagccg cccagtacgc cgactgactg cggtgtcaaa cctaagtgac 3240 
tgactactgg aatgcctcca agcgtcaatg cgcgaatgga ttcaaacacg ccttctccgc 3300 
cctcaagctt cagtgcgtca gctccgcttt cctgaacgat agccgctgca tttttcagcg 3360 
tatcttcctt agacaggtga taagacataa acggcatatc tgtcacaata aaggtattcg 3420 
gcgcacccct tttaacggct tttgtatgat ggatcatgtc cgcaactgtc acaccgacag 3480 
ttgaatcaag gccgaggacg accattccaa gtgaatcacc gactaaaatc atgtcaactc 354 0 
ccgcttgttc agcaagttta gctgccggat aatcataagc ggtcagcatg acaatcggtt 3600 
cttcagactc cttcattttt agaaaatcca gttttgtttt catgttttct cctccttcta 3660 
gagcgtcctg ctgttgttaa gattattata ccacaccttg tagataaagt caacaacttt 3720 
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ttgcaaaatt tttcaggaat tttagcagag gttgttctgg atgtagaaca aaacatcttt 3780 
ccgctcttgt gctgttagga tatctttctt ggaagctagg taggcctcga gttatggcag 3840 
ttggttaaaa ggaaacaaaa agaccgtttt cacacaaaac ggtctttttc gatttctttt 3900 
tacagtcaca gccacttttg caaaaaccgg acagcttcat gccttataac fcgctgtt*cg 3960 
gtcgacaagc ttcgcgaagc ggccgcaaaa ttcactggcc gtcgttttac aacgtcgtga 4020 
ctgggaaaac cctggcgtta cccaacttaa tcgccttgca gcacatcccc -cttt<:gccag 4080 
ctggcgtaat agcgaagagg cccgcaccga tcgcccttcc <:aacagttgc gcagcctgaa 4140 
tggcgaatgg ^cgcctgatgc ggtattttct ccttacgcat ctgtgcggta t£tca*caccg 4200 
catatggtgc actctcagta caatctgctc tgatgccgca tagttaagcc agccccgaca 42-60 
cccgccaaca cccgctgact atgcttgtaa accgttttgt gaaaaaattt ttaaaataaa 4 320 
•aaaggggacc tctagggtcc ccaattaatt agtaatataa -tctattaaag gtcattcaaa 4380 
aggtcatcca ccggatcagc ttagtaaagc cctcgctaga ttttaatgcg gatgttgcga 4440 
ttacttcgcc aactattgcg ataacaagaa aaagccagcc tttcatgata tatctcccaa 4500 
tttgtgtagg gcttattatg cacgcttaaa aataataaaa qcagacttga cctgatagtt 4 560 
tggctgtgag caattatgtg cttagtgcat ctaacgct-tg agttaagccg cgccgcgaag 4620 
cggcgtcggc ttgaacgaat tgttagacat tatttgccga -ctaccttggt gatctcgoct 4*680 
ttcacgtagt ggacaaattc ttccaactga tctgcgcgcg aggccaagcg atcttcttct. 4740 
tgtccaagat aagcctgtct agcttcaagt atgacgggct gatactgggc cggcaggcgc 4800 
tccattgccc agtcggcagc gacatccttc ggcgcgattt tgccggttac tgcgotgtac 48'60 
caaatgcggg acaacgtaag cactacattt cgctcatcgc .cagcocagtc gggcggcgag 4920 
ttccatagcg ttaaggtttc atttagcgcc tcaaatagat cctgttcagg aaccggatca 4980 
aagagttcct ccgccgctgg acctaccaag gcaacgctat gttctcttgc ttttgtcagc 5040 
aagatagcca gatcaatgtc gatcgtggct ggctcgaaga tacctgcaag aatgtcattg 51*00 
cgctgccatt ctccaaattg cagttcgcgc ttagctggat aacgccacgg aatgatgtcg 5160 
tcgtgcacaa caatggtgac ttctacagcg cggagaatct cgctctctcc aggggaagcc 5220 
gaagtttcca aaaggtcgtt gatcaaagct cgccgcgttg tttcatcaag ccttacggt<: 5280 
accgtaacca gcaaatcaat atcactgtgt ggcttcaggc cgccat<:cac tgcggagccg 5340 
tacaaatgta cggccagcaa -cgtcggttcg agatggcgct cgatgacgcc aactacct-ct 5400 
gatagttgag tcgatacttc ggcgatcacc gcttccctca tgatgtttaa ctttgtttta 5460 
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gggcgactgc cctgctgcgt aacatcgttg 
gcgtaacgcg cttgctgctt ggatgcccga 
taacaagcca tgaaaaccgc cactgcgccg 
gaccagttgc gtgagcgcat acgctacttg 
gtccactggg ttcgtgcctt catccgtttc 
agcagcgaag tcgaggcatt tctgtcctgg 
acgcatcgtc aggcattggc ggccttgctg 
ctgccctggc ttcaggagat cggaagacct 
accccggatg aagtggttcg catcctcggt 
cagcttctgt atggaacggg catgcggatc 
ctggatttcg atcacggcac gatcatcgtg 
ttgatgttac ccgagagctt ggcacccagc 
tgaccttttg aatgaccttt aatagattat 
ccctttttta ttttaaaaat tt;tttcacaa 
cccgcaaacg ggctgttctg gtgttgctag 
ggtttgccgg ctgaaagcgc tatttcttcc 
gcgtcactgg ctcccgtgtt gtcggcagct 
ggctgtctat gtgtgactgt tgagctgtaa 
ctagttgctt tgttttactg gtttcacctg 
ttacattgtc gatctgttca tggtgaacag 
ctgatgtatc tatctttttt acaccgtttt 
atatctaacg gtgaacagtt gttctacttt 
atacaagagc cataagaacc tcagatcctt 
gttcgttgtt tttgcgtgag ccatgagaac 
tcactcaaaa attttgcctc aaaactggtg 
agtgtttttc ttagtccgtt acgtaggtag 
tcaccattca tttttatctg gttgttctca 
agttcaactt ggaaaatcaa cgtatcagtc 
atattgctgt aagtgtttaa atctttactt 
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ctgctccata 


acatcaaaca tcgacccacg 5520 


ggcatagact 


gtaccccaaa 


aaaacagtca 


5580 


ttaccaccgc 


tgcgttcggt 


caaggttctg 5640 

♦ 


cattacagct 


tacgaaccga 


acaggcttat 


5700 


cacggtgtgc 


gtcacccggc 


aaccttgggc 


5760 


ctggcgaacg 


agcgcaaggt 


ttcggtctcc 


5820 


ttcttctacg 


gcaaggtgct 


gtgcacggat 


5880 


cggccgtcgc 


ggcgcttgcc 


ggtggtgctg 


5940 


tttctggaag 


gcgagcatcg 


tttgttcgcc 


6000 


agtgagggtt 


tgcaactgcg 


ggtcaaggat 


6060 


cgggagggca 


agggctccaa 


ggatcgggcc 


6120 


ctgcgcgagc 


aggggaattg 


atccggtgga 


6180 


attactaatt 


aattggggac 


cctagaggtc 


6240 


aacggtttac 


aagcataacg 


ggttttgctg 


6300 


tttgttatca 


gaatcgcaga 


tccggcttca 


6360 


agaattgcca 


tgattttttc 


cccacgggag; 6420 


ttgattcgat 


aagcagcatc 


gcctgtttca.. -6480 


caagttgtct 


caggtgttca 


atttcatgtt 


6540 


ttctattagg 


tgttacatgc 


tgttcatctg 


6600 


ctttaaatgc 


accaaaaact 


cgtaaaagct 


6660 


catctgtgca 


tatggacagt 


tttccctttg 


6720 


tgtttgttag 


tcttgatgct 


tcactgatag 


6780 


ccgtatttag 


ccagtatgtt 


ctctagtgtg 


6840 


gaaccattga 


gatcatgctt 


actttgcatg 


6900 


agctgaattt 


ttgcagttaa 


agcatcgtgt 


6960 


gaatctgatg 


taatggttgt 


tggtattttg 


7020 


agttcggtta 


cgagatccat 


ttgtctatct 


7080 


gggcggcctc 


gcttatcaac 


caccaatttc 


7140 


attggtttca 


aaacccattg 


gttaagcctt 


7200 
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ttaaactcat ggtagttatt ttcaagcatt aacatgaact *aaattcatc aaggctaatc 72-60 
tctatatttg ccttgtgagt tttcttttgt gttagttctt ttaataacca ctcataaatc 7320 
ctcatagagt atttgttttc aaaagactta acatgttcca gattatattt tatgaatttt 7380 
tttaactgga aaagataagg caatatctct tcactaaaaa ctaattctaa tttttcgctt 7 4 40 
gagaacttgg catagtttgt ccactggaaa atctcaaagc ctttaaccaa aggattcctg 7 500 
atttccacag ttctcgtcat cagctctctg gttgctttag ctaatacacc ataagcattt 7 560 
tccctactga tgttcatcat ctgagcgtat tggttataag tgaacgatac -cgt-ccgttct 7620 
ttccttgtag ggttttcaat cgtggggttg agtagtgcca <cacagcataa aattagcttg 7*680 
gtttcatgct ccgttaagtc atagcgacta atcgctagtt -catttgcttt gaaaacaact 77 40 
aattcagaca tacatctcaa ttggtctagg tgattttaat -cactatacca at*tgagatgg 7800 
gctagtcaat gataattact agtcctttOc ctttgagttg tgggtatctg taaat<x:tgc 78*0 
tagacctttg ctggaaaact tgtaaattct gctagaccct ctgtaaattc cgctagacet 7920 
ttgtgtgttt tttttgttta tattcaagtg gttataattt atagaataaa gaaaqaataa 7980 
aaaaagataa aaagaataga tcccagccct gtgtataact cactacttta gtcaqttccg 8040 
cagtattaca aaaggatgtc gcaaacgctg tttgct-cctc tacaaaacag accttaaaac 31O0 
cctaaaggct taagtagcac >cctcgcaagc tcgggcaaat cgctgaatat tecttttgtc 31-60 
tccgaccatc aggcacctga gtcgotgtct ttttcgtgac attcagttcg ctgcgctcao 8220 
ggctctggca gtgaatgggg gtaaatggca ctacaggcgc cttttatgga ttcatgcaag 8280 
gaaactaccc ataatacaag aaaagcccgt cacgggcttc tcagggcgtt ttatggcggg 8340 
tctgctatgt ggtgctatct gactttttgc tgttcagcag ttcctgccct ctgattttcc 8400 
agtctgacca cttcggatta tcccgtgaca ggt<cattcag actggctaat gcacccagta 84*0 
aggcagcggt atcatcaaca ggcttacccg tcttactgtc aac 8503 

<210> 94 
<211> 7381 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : recombinant 
pAN006 plasmid 

<400> 94 

ttgcggccgc ttcgaaagct gtaatataaa aaccttxrttc aactaacggg gcaggttagt 60 
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vjacaL Layaci 




dddclciyUclCa 


gtcggcatta 


cctcatatta 


taaaagccag t 


120 




L a LCUy aLad 




gagttcataa 


acaatcctgc 


atgataacca 


180 




aatyatytaC 


cugtaaagat 


agcggtaaat 


atattgaatt 


acctttatta 


240 


atydaLlLLC 


CLyCLytddL 


aatgggtaga 


aggtaattac 


tattattatt 


gatatttaag 


300 


^ ^ ^ ^3 r+ n /r 

LLdddCCCdg 


caaa cgaagt 


ccatggaata 


atagaaagag 


aaaaagca.tt 


ttcaggtata 


360 


ggtgt c utgg 


gaaacaattt 


ccccgaacca 


ttatatttct 


ctacatcaga 


aaggtataaa 


420 


tcataaaact 


ctttgaagt c 


attctttaca 


ggagtccaaa 


taccagagaa 


tgttttagat 


480 


acaccatcaa 


aaattgtata 


aagtggctct 


aacttatccc 


aataacctaa 


ctctccgtcg 


540 


ctattgtaac 


cagttctaaa 


agctgtattt 


gagtttatca 


cccttgtcac 


taagaaaata 


600 


aatgcagggt 


aaaatttata 


tccttcttgt 


tttatgtttc 


ggtataaaac 


actaatatca 


660 


atttctgtgg 


ttatactaaa 


agtcgtttgt 


tggttcaaat 


aatgattaaa 


tatctctttt 


720 


ctcttccaat 


tgtctaaatc 


aattttatta 


aagttcattt 


gatatgcctc 


ctaaattttt 


780 


atctaaagtg 


aatttaggag 


gcttact tgt 


ctgctttctt 


cattagaatc 


aatccttttt 


840 


taaaagtcaa 


tattactgta 


acataaatat 


atattttaaa 


aatatcccac 


tttatccaat 


900 


t ttcgtttgt 


tgaactaatg 


ggtgctttag 


ttgaagaata 


aagaccacat 


taaaaaatgt 


960 


ggtcttttgt 


gtttttttaa 


aggatttgag 


cgtagcgaaa 


aatccttttc 


tttcttatct 


1020 


tgataataag 


ggtaactatt 


gaattcggta 


ccaagagttt 


gtagaaacgc 


aaaaaggcca 


1080 


tccgtcagga 


tggccttctg 


cttaatttga 


tgcctggcag 


tttatggcgg 


gcgtcctgcc 


1140 


cgccaccctc 


cgggccgttg 


cttcgcaacg 


ttcaaatccg 


ctcccggcgg 


atttgtccta 


1200 


ctcaggagag 


cgttcaccga 


caaacaacag 


ataaaacgaa 


aggcccagtc 


tttcgactga 


1260 


gcctttcgtt 


t tatttgatg 


cctggcagtt 


ccctactctc 


gcatggggag 


accccacact 


1320 


accaccggcg 


CLdcggcgtL 


tcacttctga 


gttcggcatg 


gggtcaggtg 


ggaccaccgc 


1380 


#T /-« 4- a nc*r+c\ 

yctactyccg 


LCdyyCdddu 


■*-/-*-/-i*-+-+-4-^if- 

cctgLtLtat 


cagaccgctt 


ctgcgttctg 


atttaatctg 


1440 




ddaaLCttCt 


ctcatccgcc 


aaaacaggat 


cctacggaaa 


tggagcggca 


1500 


aaaccgtttt 


actctcaaaa 


tcttaaaaga 


aaacccccga 


taaagggggc 


ttttcttcta 


1560 


caaaattgta 


cgggctggtt 


cgttccccag 


catttgttca 


attttgtttt 


gatcattcag 


1620 


aacagccact 


ttcggctcat 


ggcttgccgc 


ttcttgatca 


gacatcattt 


tgtaggaaat 


1680 


aataatgacc 


ttatctcctt 


cctgcacaag 


gcgtgcggct 


gcaccgttta 


agcatatgac 


1740 


gccgcttccc 


cgtttaccag 


gaataatata 


cgtttcaaga 


cgtgctccat 


tattattatt 


1800 
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cacaatttgt acttttt-cat taggaagcat toccacagca tcaatgagat cttcatcaat 1860 
tgtaatgctt cccacatagt tcaggtttgc *tccgtaaca gttgccctgt gaagtttgcc 1920 
gctcatcatt gttcgataca tattatattc tctccatttc tcgaatatca ataatgatat 1980 
tatctattaa acgcgctttt gaaaaagcaa ctgcaacagc gagaatcatc tttccagcaa 2040 
tttcattcac aggctcgagt tccggatagg aataaagctc tacatagt<:t atggttccgc 2100 , 
tagtcgtttc aatgatatct tttgcagctt ttatcaocgc ttcaggatct ctttcaccgg 2160 
cttggacaag ttccgcactt gtttgaaggg cccgatacag cttaggcgct -tctttt-cttt 2220 
cctcagctgt taagtataca ttgcgagagc ttttggctaa gccgtcttcc tctctgaccg 2280 
tatcgacagg aaccaattca atatccatga agaagtcgct gattaaccea tcaacaacag 2340 
ctacctgctg cgcatctttt aaaccgaaat aggcacgagt -cggcttgact agattgaaaa 2400 
gcttcgtcag tacgatcgcg accccgtcaa aatgtccttc tcttga^cgc ccgcataaca 24-60 
cgtctgtgcg tctttctaca tgaat-cgtga -cattcttttc accgggatac atatcatgag 2520 
catctggcgt aaaaagaata tcgactccgg cgttttctgc aagagctgca t<:ccgctcaa 2580 
tatcgcgcgg atatgcttca aaatctticat' tagggccgaa ttgtgcagga ttoacaaaaa* 2640 
tactcataat aacggcgtcg ttttcttgtc ttgctttgtc tgctaaggtt aaatgcccct 2700 
catgcagaaa ccccatcgtc ggaacaaatc -cgattgactt gccctctgaa tggtatcgtt: 2760 
ttatggcttc tttcagctgt gaaatat<;ag Haatctgtct catctvtattt tcocccgtac 2820 
aagccgtcaa gcactgtctg gttcatttga aaggaatgct tttgttcagg gaaagcacga 2880 
tgtcttacat cctgaacata tccgctgatt gctgtt*cga <ggtttcatc aatgcgcgta 2940 
tattgcttta caaatttagg tgttctctca acaccgtggc cgataatatc atgataaacg 3000 
agaacttgtc cgtccgcttt cacaccagcc ccgattccaa tgaccggtat .gcttagcgtc 3060 
tcggcaattt tggctgtgag ttctgccggc acacattcca gcacaagcat catagctcct 3120 
gcttcttcgc attttatact gtcttctatt aattttttgg cgctttgttc gtctttgccc 3180 
tgtactttat agccgcccag tacgccgact gactgcggtg tcaaacotaa gtgactgact 3240 
actggaatgc ctccaagcgt caatgcgcga atggattcaa acacgccttc tccgccctca 3300 
agcttcagtg cgtcagctcc gctttcctga acgatagccg ctgcattttt cagcgtatct 3360 
tccttagaca ggtgataaga cataaacggc atatctgtca caataaaggt attcggcgca 3420 
ccccttttaa cggcttttgt atgatggatc atgtccgcaa ctgtcacacc gacagttgaa 3480 
tcaaggccga ggacgaccat tccaagtgaa tcaecgacta aaatcatgtc aactcccgct 3540 
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tgt tcagcaa 


gtttagctgc 


cggataatca 


taagcggtca 


gcatgacaat 


cggttcttca 


3600 


gactccttca 


tttttagaaa 


atccagtttt 


gttttcatgt 


tttctcctcc 


tctagagcgt 


3660 


cctgctgttg 


ttaagat;tat 


tataccacac 


cttgtagata 


aagtcaacaa 


ctttttgcaa 


3720 


aatttttcag 


gaattttagc 


agaggttgtt 


ctggatgtag 


aacaaaacat 


ctttccgctc 


3780 


ttgtgctgtt 


aggatatctt 


tcttggaagc 


taggtaggcc 


tcgagtta'tg 


gcagttggtt 


3840 


aaaaggaaac 


aaaaagaccg 


ttttcacaca 


aaacggtctt 


tttcgatttc 


tttttacagt 


3900 


cacagccact 


tttgcaaaaa 


ccggacagct 


tcatgcctta 


taactgctgt 


ttcggtcgac 


3960 


ctgcaggcat 


gcaagcttcg 


cgaagcggcc 


gccgacgcga 


ggctggatgg 


ccttccccat 


4020 


tatgattctc 


ctcgcttccg 


gcggcatcgg 


gatgcccgcg 


ttgcaggcca 


tgctgtccag 


4080 


gcaggtagat 


gacgaccatc 


agggacagct 


tcaaggatcg 


ctcgcgqctc 


ttaccagcct 


4140 


aacttcgatc 


actggaccgc 


tgatcgtcac 


ggcgatttat 


gccgcctcgg 


cgagcacatg 


4200 


gaacgggttg 


gcatggattg 


taggcgccgc 


cctatacctt 


gtctgcctcc 


ccgcgttgcg 


4260 


tcqcggtgca 


tggagccggg 


ccacctcgac 


ctgaatggaa 


gccggcggca 


cctcgctaac 


4320 


gyattcacca 


ctccaagaat 


tggagccaat 


caattctcgc 


ggagaactgt 


gaatgcgcaa 


4380 


accaaccctt 


gccagaacat 


atccatcgcg 


tccgccatct 


ccagcagccg 


cacgcggcgc 


4440 


3tctcgggca 


gcgttgggtc 


ctggccacgg 


gtgcgcatga 


tcgtgctcct 


gtcgttgagg;. 4500 


acccggctag 


gctggcgggg 


ttgccttact 


ggttagcaga 


atgaatcacc 


gatacgcgag 


4560 


cgaacgtgaa 


gcgactgctg 


ctgcaaaacg 


tctgcgacct 


gagcaacaac 


atgaatggtc 


4620 


ttcggtttcc 


gtgtttcgta 


aagtctggaa 


acgcggaagt 


cagcgccctg 


caccattatg 


4680 


ttccggatct 


gcatcgcagg 


atgctgctgg 


ctaccctgtg 


gaacacctac 


atctgtatta 


4740 


acgaagcgct 


ggcattgacc 


ctgagtgatt 


tttctctggt 


cccgccgcat 


ccataccgcc 


4800 


agttgtttac 


cctcacaacg 


ttccagtaac 


cgggcatgtt 


catcatcagt 


aacccgtatc 


4860 


gtgagcatcc 


tctctcgttt 


catcggtatc 


attaccccca 


tgaacagaaa 


ttccccctta 


4920 


cacggaggca 


tcaagtgacc 


aaacaggaaa 


aaaccgccct 


taacatggcc 


cgctttatca 


4980 


gaagccagac 


attaacgctt 


ctggagaaac 


tcaacgagct 


ggacgcggat 


gaacaggcag 


5040 


acatctgtga 


atcgcttcac 


gaccacgctg 


atgagcttta 


ccgcagctgc 


ctcgcgcgtt 


5100 


tcggtgatga 


cggtgaaaac 


ctctgacaca 


tgcagctccc 


ggagacggtc 


acagcttgtc* 


5160 


tgtaagcgga 


tgccgggagc 


agacaagccc 


gtcagggcgc 


gtcagcgggt 


gttggcgggt 


5220 


gtcggggcgc 


agccatgacc 


cagtcacgta 


gcgatagcgg 


agtgtatact 


ggcttaacta 


5280 
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tgcggcatca gagcagattg tactgagagt gcaceatatg cggtgtgaaa taccgcacag 5340 
atgcgtaagg agaaaatacc gcatcaggcg ctctt^cgct tcctcgctca ctgact>cgct 5400 
gcgctcggtc gttcggctgc ggcgagcggt atcagctcac tcaaaggcgg taatacggtt 5460 
atccacagaa tcaggggata acgcaggaaa gaacatgtga gcaaaaggcc agcaaaaggc 5520 
caggaaccgt aaaaaggccg -cgttgctggc gtttttccat aggctccgcc cccctgacga 5580 
gcatcacaaa aatcgacgct caagtcagag gtggcgaaac ccgacaggac ^ataaagata 5640 
ccaggcgttt ccccctggaa gctccctcgt gcgct-ctcct gttccgaccc tgccgcttac 5700 
cggatacctg tccgcctttc tcccttcggg aagcgtggcg ctttctcata gctcacgctg 5760 
taggtatctc agctcggtgt aggtcgttcg ctccaagctg ggctgtgt^c acgaaccccc 5820 
cgttcagccc gaccgctgcg ccttatccgg taactat<:gt cttgagtcca acccggtaag 5880 
acacgactta tcgccactgg cagcagccac tggtaacagg attagcagag cgaggta-tgt 5940 
aggcggtgct acagagttct tgaagtggtg gcctaactac ggctaca-cta gaagqacagt €000 
atttqgtatc tgcgctctgc tgaagccagt -taccttcgga aaaagagttg qtagctcttg 6060 
atccggcaaa caaaccaccg ct^gtagcgg tggttttttt gtttgcaagc agcagattac 6120 
gcgcagaaaa aaaggatctc aagaagatcc tttgatcttt *<:tacggggt ctgacgctca 6180 
gtggaacgaa aactcacgtt aagggat-ttt ggtcatgaga 4:«:atcaaaaa ggatcctcac. 6240 
ctagatcctt ttaaattaaa aatgaagttt taaatcaatc taaagtatat atgagtaaac 6300 
ttggtctgac agttaccaat gcttaatcag tgaggcacct atotcagcga t-ctqtctatt 6360 
tcgttcatcc atagttgcct gactccccqt cgtgtagata actacgatac gggagggctt 64 20 
accatctggc cccagtgctg caatgatacc gogagaccca cgctcaccgg ct<x:agattt 64*80 
atcagcaata aaccagccag ccggaagggc cgagcgcaga a^tggtcctg <:aactttatc 6540 
cgcctccatc cagtctatta attgttgccg ggaagctaga giaagtagtt cgccaqttaa 6600 
tagtttgcgc aacgttgttg ccattgctgc aggcafccgtg gtgtcacgct cgtcgtttgg 6660 
tatggcttca ttcagctccg gttcccaacg at<:aaggcga gttacatgat cccccatgtt 6720 
gtgcaaaaaa gcggttagct ccttcggtcc tccgat-cgtt gt<:agaagta agttggccgc 6780 
agtgttatca ctcatggtta tggcagcact gcataattct -cttaotgtca tgccatccgt 6840 
aagatgcttt tctgtgactg gtgagtactc aaccaagtca ttct-gagaat agtgtatqcg 6900 
gcgaccgagt tgctcttgcc cggcgtcaat acgggataat accgcgccac atagcagaac 6960 
tttaaaagtg ctcatcattg gaaaacgttc ttcggggcga aaactct>caa ggatcttacc 7020 
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gctgttgaga tccagttcga tgtaacccac 

tactttcacc agcgtttctg ggtgagcaaa 

aataagggcg acacggaaat gttgaatact 

catttatcag ggttattgtc tcatgagcgg 

acaaataggg gttccgcgca catttccccg 

tattatcatg acattaacct ataaaaatag 
a 
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tcgtgcaccc aactgatctt cagcatcttt 7080 
aacaggaagg cadaatgccg caaaaaaggg 7140 
catactcttc ctttttcaat attattgaag 7200 
atacatattt gaatgtattt agaaaaataa 7260 
aaaagtgcca cctgacgtct aagaaaccat 7320' 
gcgtatcacg aggccctttc gtcttcaaga 7380 

7381 
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an extent that no meaningful International Search can be carried out, specifically: 



3-D 

because they are dependent claims and are not drafted in accordance with the second and third sentences of Rule 6.4{a). 



Box II Observations where unity of invention is lacking (Continuation of item 2 of first sheet) 



This International Searching Authority found multiple inventions in this international application, as follows: 



see additional sheet 



□ As all required additional search fees were timely paid by the applicant, this International Search-Report-covers all 
searchable claims. 
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L * J res tricted to the invention first mentioned in the claims; it is covered by claims Nos.: 



, this International Search -Report rs 



1-6,12,13,24,26-28,33-35,48,51,54-64,71 ,76,78-86,92,93,95,9Z1430,l<J2,l<33 
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FURTHER INFORMATION CONTINUED FROM PCT/ISA/ 210 



This International Searching Authority found multiple (groups of) 
inventions in this international application, as follows: 

1. Claims: 1-6,1243,24,26-28,33-35,48,51,54-64,71,76,78-86, 
92,93,95,97-100,102,103 (all partially) 



A method of producing pantothenate (e.g., 2 g/L up to 40 g/L 
at least), pantoate, or ketopantoate, e.g., in a manner 
independent of precursor feed, comprising culturing a 
microorganism (e.g., Gram-positive (e.g., belonging to the 
genus Bacillus, Corynebacterium, Lactobacillus, Lactococci 
or Streptomyces) or Gram-negative) which overexpresses a 
ketopantoate hydroxymethyl transferase-encoding gene, e.g. 
the panB gene, e.g., from Bacillus, under conditions such 
that said panto- compound is produced, and possibly further 
recovering the compound. A recombinant microorganism (e g 
Gram-positive (e.g., belonging to the genus Bacillus (e!g" > 
Bacillus subtilis), Corynebacterium, Lactobacillus, 
Lactococci or Streptomyces)) which overexpresses a'sacillus 
(subtilis) ketopantoate hydroxymethyl transferase-encoding 
gene. A recombinant vector encoding a Bacillus (subtilis) 
ketopantoate hydroxymethyl transferase-encoding gene operably 
linked to regulatory sequences, e.g., comprising a nucleic 
acid sequence according to SEQ ID NO: 23 or part of SEQ 10 
N0:59. An isolated nucleic acid molecule encoding a Bacillus 
(subtilis) ketopantoate hydroxymethyl transferase, and said 
isolated ketopantoate hydroxymethyl transferase polypeptide 



2. Claims: 1-6,12,13,24,26-28,33-35,48,51,54-64,71,76,78-86, 
92,93,95,97-100,102,103 (all partially) 



A method of producing panthotenate (e.g., 2 g/L up to 40 g/L 
at least), pantoate, or ketopantoate, e.g., in a manner 
independent of precursor feed, comprising culturing a 
microorganism («.g., Gram-positive (e.g., belonging to the 
genus Bacillus, Corynebacterium, Lactobacillus, Lactococci 
or Streptomyces) or Gram-negative) which overexpresses a 
pantothenate synthetase-encoding gene, e.g. the panC gene 
e.g., from Bacillus, under conditions such that said 
panto-compound is produced, and possibly further recovering 
the compound. A recombinant microorganism (e.g., 3 
Gram-positive (e.g., belonging to the genus Bacillus (e.g., 
Bacillus subtilis), Corynebacterium, Lactobacillus, 
Lactococci or Streptomyces)) which overexpresses a Bacillus 
(subtilis) pantothenate synthetase-encoding gene. A 
recombinant vector encoding a Bacillus (subtilis) 
pantothenate synthetase-encoding gene operably linked to 
regulatory sequences, e.g., comprising a nucleic acid 
sequence according to SEQ ID NO: 25 or part of SEQ ID N0:59 
An isolated nucleic acid molecule encoding a Bacillus 
(subtilis) pantothenate synthetase, and said isolated 
pantothenate synthetase polypeptide. 
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3. Claims: l-6 f 12-14 t 24 t 26-28 f 33-35,48 f 49;51 f 54-64 f «« f 71 f 76 f 

78-86,92,93,95,97-100,102,103 <al 1 partially); 15, 
17,19,23,32,106,107 (both completely) 



A method of producing pantothenate (e.g., 2 g/L up to 40 g/L 
at least), pantoate, or ketopantoate, e.g. , independent of 
aspartate or beta-alanine feed, comprising culturing a 
microorganism (e.g., Gram-positive (e.g., belonging to the 
genus Bacillus, Corynebacterium, Lactobacillus, lac tococci 
or Streptomyces) or dram-negative) which overexpresses an 
aspartate-alpha-decarboxylase-encoding gene, e.g*, from • 
Bacillus, e.g., the aspartate-alpha-decarboxylase-encoding 
panD gene from Bacillus subtil is, under conditions such that 
. said panto-compound is produced, and possibly further 
recovering the compound. A recombinant microorganism (e.g., 
Gram-positive (e.g., belonging to the genus Bacillus {e.g., 
Bacillus subtil is), Corynebacterium, Lactobacillus, 
Lactococci or Streptomyces)) which overexpresses a Bacillus 
(subtilis) aspartate-;^ pha-decarboxylase-encoding gene. A 
recombinant vector encoding a Bacillus {subtil is) 
aspartate-alpha-decarboxylase-encoding gene operably linked 
to regulatory sequences, e.g., comprising a nucleic acid 
sequence according to SEQ ID NO: 27 or part of S£<| IX) N0:59. 
An isolated nucleic acid molecule -encoding a Bacillus 
(subtil is) aspartate-alpha-decarboxylase, and said isolated 
aspartate-alpha-decarboxylase polypeptide. 

4. Claims: 1-6,24,26-28,33-35,48,49,51,54-64,71,76,73-87,92, 

93,95,97-100,102,103 (all partially); 7-11,65,101, 
104,105 (all completely) , 



A method of producing pantothenate -(e.g., 2 g/L up to 40 g/L 
at least), pantoate, or ketopantoate, e.g., independent of 
precursor feed, comprising culturing a microorganism (e.g., 
Gram-positive (e.g., belonging to the genus Bacillus, 
Corynebacterium, Lactobacillus, Lactococci or Streptomyces) 
or Gram-negative) which overexpresses a ketopantoate 
reductase-encoding gene, e.g., from Bacillus, e.g., the 
ketopantoate reductase-encoding panEl gene from Bacillus 
subtil is, under conditions such that said panto-compound is 
produced, and possibly further recovering the compound. A 
recombinant microorganism <e.g., Gram-positive (e.g., 
belonging to the genus Bacillus (e.g., Bacillus subtil is), 
Corynebacterium, Lactobacillus, Lactococci or Streptomyces)) 
which overexpresses a Bacillus (subtil is) ketopantoate 
reductase-encoding gene. A recombinant vector encoding a 
Baci 1 lus (subti lis) ketopantoate reductase-encodi ng gene 
operably linked to regulatory sequences, e.g., comprising a 
nucleic acid sequence according to StQ 10 N0:29. An isolated 
nucleic acid molecule encoding a Sacillus (subtilis) 
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ketopantoate reductase, and said isolated ketopantoate 
reductase polypeptide. 



5. Claims: 14,16,18,28,48,54-61,66,77-82, 

97 (all partially); 20,29 (completely) 



A method of producing pantothenate (e.g., 2 g/L up to 40 g/L 
at least), e.g., in a manner independent of valine or 
alpha-ketoisovalerate feed, comprising culturing a 
microorganism (e.g., Gram-positi ve (e.g., belonging to the 
genus Bacillus, Corynebacterium, Lactobacillus, Lactococci 
or Streptomyces) or Gram- negative) having a deregulated 
isoleucine-valine (ilv) pathway, wherein the microorganism 
overexpresses acetohydroxyacid synthase or is transformed 
with a vector comprising an ilvBN nucleic acid, sequence or 
an alsS sequence, e.g., from Bacillus, under conditions such 
that pantothenate is produced, and possibly further 
recovering the pantothenate. Said microorganism (e.g., 
Gram-positive (e.g., belonging to the genus Bacillus (e.g., 
Bacillus subtilis), Corynebacterium, Lactobacillus, 
Lactococci or Streptomyces)), and said vector. , 



6. Claims: 14,16,18,28,48,54-61,66,77-82, 

97 (all partially); 21,30 (completely) 



A method of producing pantothenate (e.g., 2 g/L up to 40 g/L 
at least), e.g., in a manner independent of valine or 
alpha-ketoisovalerate feed, comprising culturing a 
microorganism (e.g., Gram-positive (e.g., belonging to the 
genus Bacillus, Corynebacterium, Lactobacillus, Lactococci 
or Streptomyces) or Gram-negative) having a deregulated 
isoleucine-valine (ilv) pathway, wherein the microorganism 
overexpresses acetohydroxyacid i someroreductase or is 
transformed with a vector comprising an ilvC nucleic acid 
sequence, e.g., from Bacillus, under conditions such that 
pantothenate is produced, and possibly further recovering 
the pantothenate. Said microorganism (e.g., Gram-positive 
(e.g., belonging to the genus Bacillus (e.g., Bacillus 
subtilis), Corynebacterium, Lactobacillus, Lactococci or 
Streptomyces)), and said vector. 



7. Claims: 14,16,18,28,48,54-61,66,77-82,97 (partially); 22, 
31 (completely) 



A method of producing pantothenate (e.g., 2 g/L up to 40 g/L 
at least), e.g., in a manner independent of valine or 
alpha-ketoisovalerate feed, comprising culturing a 
microorganism (e.g., Gram-positive (e.g., belonging to the 
genus Bacillus, Corynebacterium, Lactobacillus, Lactococci 
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. or Streptomyces) or Gram-negative) having a deregulated 
i sol eucine- valine <i 1 v) pathway, wherein the microorganism 
overexpresses dihydroxyacid dehydratase or is transformed 
with a vector comprising an ilvO nucleic acid sequence, 
e.g., from Bacillus, under conditions such that pantothenate 
is produced, and possibly further recovering the 
pantothenate. Said microorganism (e.g., Gram-positive <e.g., 
belonging to the genus Bacillus {e.g., Bacillus -subtil is) , 
Corynebacterium, Lactobacillus, Lactococci or 
Streptomyces)), and said vector. 

8. Claims: 25,28,50,54-51 <all partially) 

A method of producing pantothenate (e.g., 2 g/L up to 40 g/L 
at least), e.g., in a manner independent of precursor feed, 
comprising culturing a microorganism (e.g., Gram-positive 
(e.g., belonging to the genus Bacillus, Corynebacterium, 
Lactobacillus, Lactococci or Streptomyces) or Gram-negative) 
having a mutant avtA gene under conditions such that 
pantothenate is produced, and possibly further recovering 
the pantothenate. 

9. Claims: -25,28,50,54-51 <all partially) 

A method of producing pantothenate ^e.g., 2 g/L up -to 40 g/L 
at least), e.g., in a manner independent of precursor feed, 
comprising culturing a microorganism {e.g., 'Gram-positive 
(e.g., belonging to the genus Bacillus, Corynebacterium, 
Lactobacillus, Lactococci or Streptomyces) or -Gram-negative) 
having a mutant ilvE gene under conditions such that 
pantothenate is produced, and possibly further recovering 
the pantothenate. 

10. Claims: 25,28,50,54-61 {all partially) 

A method of producing pantothenate (e.g., 2 g/L up to 40 g/L 
at least), e.g., in a manner independent of precursor feed, 
comprising culturing a microorganism {e.g., -Gram-positive 
(e.g., belonging to the genus Bacillus, Corynebacterium, 
Lactobacillus, Lactococci or Streptomyces) or Gram-negative) 
having a mutant ansB gene under conditions such that 
pantothenate is produced, and possibly further recovering 
the pantothenate. 

11. Claims: 25,28,'50,S4-61 (all partially) 

A method of producing pantothenate {e.-g. , 2 g/L up to 40 g/L 
at least), e.g., in a manner independent of precursor feed, 
comprising culturing a microorganism {e.g., Gram-positive 
(e.g., belonging to the genus Bacillus, Corynebacterium, 
Lactobacillus, Lactococci or Streptomyces) or Gram-negative) 
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having a mutant alsO gene under conditions such that 
pantothenate is produced, and possibly further recovering 
the pantothenate. 



12. Claims: 36,37,54-59,61 (all partially) 

A method of producing beta-alanine, comprising culturing a 
microorganism (e.g., Gram-positive (e.g., belonging to the 
genus Bacillus, Corynebacterium, Lactobacillus, Lactococci 
or Streptomyces) or Gram-negative) which overexpresses an 
aspartate-alpha-decarboxylase-encoding gene, under 
conditions such that beta-alanine is produced, wherein the 
asparte-alpha-decarboxylase-overexpressing microorganism has 
a mutation in a nucleic acid sequence encoding ketopantoate 
hydroxymethyl transferase (panB), and possibly further 
recovering the compound. 



13. Claims: 36,37,54-59,61 (all partially) 

A method of producing beta-alanine, comprising culturing a 
microorganism (e.g. , Gram-positive (e.g., belonging to the 
genus Bacillus, Corynebacterium, Lactobacillus, Lactococci 
or Streptomyces) or Gram- negative) which overexpresses an 
aspartate-alpha-decarboxylase-encoding gene, under 
conditions such that beta-alanine is produced, wherein the 
asparte-alpha-decarboxylase-overexpressing microorganism has 
a mutation in a nucleic acid sequence encoding ketopantoate 
reductase (panE), and possibly further recovering the 
compound. 



14. Claims: 36,37,54-59,61 (all partially) 

A method of producing beta-alanine, comprising culturing a 
microorganism (e.g., Gram-positive (e.g., belonging to the 
genus Bacillus, Corynebacterium, Lactobacillus, Lactococci 
or Streptomyces) or Gram- negative) which overexpresses an 
aspartate-alpha-decarboxylase-encoding gene, under 
conditions such that beta-alanine is produced, wherein the 
asparte-alpha-decarboxylase-overexpressing microorganism has 
a mutation in a nucleic acid sequence encoding pantothenate 
synthetase (panC), and possibly further recovering the 
compound. 



15. Claim : 38 (completely) 

A method of producing beta-alanine comprising contacting a 
composition comprising aspartate with an isolated Bacillus 
aspartate-alpha-decarboxylase enzyme under conditions such 
that beta-alanine is produced. 



I 
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16. Claims: 41,44-47,51,53,54-81,89,71,72,75,78-81, 

97 {all partially); 39,43,52,67,70,74,88-91, 
168-110 {all completely) 



A method for producing or -f or enhancing production of 
ketopantoate, pantoate, or pantothenate (e.g., 1*0, 20 or 40 
g/L at least), comprising cuUuring a mutant microorganism 
(e.g., Gram-positive {e.g., belonging to the genus 8acillus, 
Corynebacterium, Lactobacillus, Lactococci or Streptomyces) 
or -Gram-negative) having a mutant pantothenate 
kinase-encoding coaX gene, under conditions such that said 
panto-compound is produced or that production is enhanced, 
and possibly further recovering the compound. A method for 
identifying compounds which modulate pantothenate kinase 
activity comprising contacting a recombinant cell expressiftg 
the coaX gene, possibly further comprising a militant coaA 
gene encoding a pantothenate kinase with reduced activity, 
with a test compound and determining the ability of the test 
compound to modulate pantothenate kinase activity in said 
cell. A recombinant microorganism having a mutant coaX ^ene 
encoding a pantothenate kinase with reduced activity. A 
vector comprising a mutant coaX gene encoding a pantothenate 
kinase with reduced activity, possibly further comprising 
regulatory sequences. A recombinant microorganism comprising 
a vector comprising an isolated coaX gene (e.g., from 
Bacillus (subtil is)), and said vector, possibly further 
comprising regulatory sequervces, e.g., a <onstitutive1y 
active promoter. A recombinant microorganism {e.g., 
Gram-positive (e.g., belonging to the genus 8acillus {e.-g., 
Bacillus subtil is), Corynebacterium, Lactobacillus, 
Lactococci or Streptomyces)) that overproduces a 
panto-compound having a mutation in a coaX gene that results 
in a reduced level of pantothenate kinase activity, 
resulting in a decrease in the capacity of the microorganism 
to synthesize coenzyme A. An isolated nucleic acid molecule 
comprising a (mutant) coaX <jene, and an isolated 
pantothenate kinase protein -encoded by a coaX gene. 



17. Claims: 41,44-47,51,53-81,69,71,72,75,78-81, 

97 (all partially); 40,42,68,73 {all completely) 



A method for producing or for enhancing production of a 
panto-compound, e.g., ketopantoate, pantoate or pantothenate 
{e.g., 10, 20 or 40 g/L at least) , -comprising -culturing a 
mutant microorganism Ke .g. , Gram-positive {e.g., belongitig 
to the genus Bacillus, Corynebacterium, Lactobacillus, 
Lactococci or Streptomyces) or Gram- negative) having a 
mutant pantothenate kinase -encoding coaA gene, under 
conditions such that the panto-compound is produced or that 
production is enhanced, and possibly further recovering the 
panto-compound. A recombinant microorganism having a mutant 
coaA gene encoding a pantothenate kinase with reduced 
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activity. A recombinant microorganism (e.g., Gram-positive 
(e.g., belonging to the genus Bacillus (e.g., Bacillus 
subtil is), Corynebacterium, Lactobacillus, Lactococci or 
Streptomyces)) that overproduces a panto-compound having a 
mutation in a coaA gene that results in a reduced level of 
pantothenate kinase activity, resulting in a decrease in the 
capacity of the microorganism to synthesize coenzyme A. A 
vector containing a (mutated) coaA gene. 



18. Claim : 94 (partially) 

A vector containing regulatory sequences comprising the 
constitutively active promoter Pveg (SEQ ID NO:41). 



19. Claim : 94 (partially) 

A vector containing regulatory sequences comprising the 
constitutively active promoter P15 {SEQ ID N0:39). 



20. Claim : 94 (partially) , 

A vector containing regulatory sequences comprising the 
-constitutive^ active promoter P26 (SEQ ID N0:40). 



21. Claim : 96 (partially) 

A vector containing regulatory sequences comprising an 
artificial RBS according to SEQ ID NO: 49. 



22. Claim : 96 (partially) 

A vector containing regulatory sequences comprising an 
artificial RBS according to SEQ ID N0:50. 



23. Claim : 96 (partially) 

A vector containing regulatory sequences comprising an 
artificial RBS according to SEQ ID N0:51. 



24. Claim : 96 (partially) 

A vector containing regulatory sequences comprising an 
artificial RBS according to SEQ ID N0:52. 



25. Claim : 96 (partially) 

A vector containing regulatory sequences comprising an 
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artificial RBS according to SEO. 10 N0:S3. 

26. Claim : 96 (partially) 

A vector containing regulatory sequence* comprising an 
artificial RBS according to "SEQ 10 N0-.S4. 

27. Claim : 96 (partially) 

A vector containing regulatory -sequences comprising an 
artificial RBS according to 'SEQ 10 N0:S5. 

28. Claim : 96 (partially) 

A vector containing regulatory sequences compri-sing an 
artificial RBS according to "SEQ 10 W0:56. 

i 

29. Claim : 96 (partially) 4 

A vector containing regulatory sequences comprising an 
artificial RBS according to S£Q ID N0:57. 
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METHODS AND MICROORGANISMS FOR 
PRODUCTION OF PANTO-COMPOUNDS 

Background of the Invention 

5 Pantothenate, also known as pantothenic acid or vitamin B5, is a member of the 

B complex of vitamins and is a nutritional requirement for mammals, including 
livestock and humans (e.g., from food sources, as a water soluble vitamin supplement or 
as a feed additive). In cells, pantothenate is used primarily for the biosynthesis of 
coenzyme A (CoA) and acyl carrier protein (ACP). These coenzymes function in the 

10 metabolism of acyl moieties which form thioesters with the sulfhydryl group of the 4'- 
phosphopantetheine portion of these molecules. These coenzymes are essential in all 
cells, participating in over 100 different intermediary reactions in cellular metabolism. 

The conventional means of synthesizing pantothenate ^in particular, the bioactive 
D isomer) is via chemical synthesis from bulk chemicals, a process which is hampered 

15 by excessive substrate cost as well as the requirement for optical resolution of racemic 
intermediates (e.g., resolution of DL-pantolactone to obtain D-pantolactone for chemical 
condensation with 0-alanine). Accordingly, researchers have recently looked to 
bacterial or microbial systems that produce enzymes useful in pantothenate biosynthesis 
processes (as bacteria are themselves capable of synthesizing pantothenate). In 

20 particular, bioconversion processes have been evaluated as a means of favoring 
production of the D isomer of pantothenic acid, e.g., using microorganisms which 
selectively hydrolyze a DL-pantothenic acid ester to D-pantothenic acid; 
microorganisms which selectively decompose L-pantolactone resulting in D- 
pantolactone alone; and microorganisms which selectively hydrolyze DL-pantolactone 

25 to D-pantoic acid. 

There is still, however, significant need for improved pantothenate production 
processes, in particular, for processes requiring reduced quantities of substrates and/or 
less expensive substrates. To this end, methods of direct microbial synthesis have 
recently been examined as a means of improving D-pantothenate production. In 

30 microbes, pantothenate biosynthetis is a multistep pathway resulting in condensation of 
pantbate (derived from a-ketoisovalerate) and p-alanine to form D-pantothenate. The 
isoleucine-valine (ilv) pathway biosynthetic enzymes, acetohydroxyacid synthetase (the 
ilvBN or alsS gene product), acetohydroxyacid isomeroreductase (the ilvC gene product) 
and dihydroxyacid dehydratase (the ilvD gene product) catalyze the conversion of 

35 pyruvate to a-ketoisovalerate. The reactions are further catalyzed by the pantothenate 
(pan) pathway biosynthetic enzymes ketopantoate hydroxymethyltransferase {ihepanB 
gene product), ketopantoate reductase (the panE gene product), aspartate-ct- 
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decarboxylase<the pan D gem product) and pantothenate synthetase -(thepa/jC gene 
product). 

The genes encoding the enzymes involved in die biosynthesis of pantothenic acid 
in Salmonella typhimurium and Escherichia coli have recently been identified and 
5 characterized (Frodyma and Downs(1998) J. Biol. Client. 273:5572-5576 and 

Jackowski (1996) pp. 687-694, In Neidhardt-er a/<ed.) Escherichia coli wd Salmonella: 
Cellular and Molecular Biology, 2 nd ed. Am. Soc. Microbiol. Wash, D.C). In E. -coli, for 
example, the biosynthesis of pantothenic acid consists of four key steps. The first 
reaction is catalyzed by the panB gene product, ketopantoate hydroxymemyltransfiarase, 
10 and uses the L-valine intermediate a-ketoisovalerate to generate ketopantoate, which is 
subsequently reduced to pantoate by the panE gpne product, ketopantoate reductase. 
The panD gene product, aspartate-a-decarboxylase, generates ^alanine from aspartate. 
The panC gene product, pantothenate synthetase, subsequently ligates P-alanine with 
pantoate to yield D-pantothenate. 
15 The authors Dusch et al. described the identification of the Corynebacterium 

glutamicvm panD gene and reported that expression of the C. glutamicum panD &ne in 
E. coli yielded a strain producing pantothenate with a specific productivity of 140 ng of 
pantothenate per mg (dry weight) per hour. <Dusch et al. (1999) Appl. Environ, 
Microbiol. 65:1530-1539). 
20 The authors Sahm and Eggeling have fiirther identified the Corynebacterium 

glutamicum panB and pan C genes and have described a genetically -engineered strain of 
C. glutamicum which overexpresses the panBC genes (Sahm and Eggeling (1999) Appl. 
Environ. Microbiol. 65:1973-1979). The engineered strain produces pantothenate, 
however, it was necessary to overexpress the genes responsible for a-ketoisovalerate 
25 production in the host organism in order that pantothenic acid production could be 
detected. Moreover, without the addition of ^-alanine, no substantial amounts of 
pantothenate accumulated with the strain constructed. 

Likewise, a method of producing D-pantothenic acid has been described that 
takes advantage of a sodium salicylate resistant mutant strain of £ -coli which produces 
30 D-pantothenic acid when cultured in the presence of ^-alanine (U .S. Patent No. 
5,5 1 8,906). Generation of E. coli strains resistant to a-ketoisovaleric acid and/or a- 
ketobutyric acid, and/or a-aminobutyric acid, and/or p-hydroxyaspartic acid and/or O- 
memyl-threonine, in addition to salicylic acid, further increased pantothenic acid 
production. Moreover, transformation of a plasmid DNA carrying the panB, panC and 
35 panD genes into the salicylic acid resistant mutant strain resulted in increased 
pantothenate production, however, up to 20 g/L ^-alanine or more was fed in the 
examples given. The panB-panC-panD genes are clustered on the E. coli chromosome. 
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Finally, a method of producing D-pantothenic acid has been described which 
utilizes a salicylic acid-resistant, a-ketoisovalerate-resistant, a-ketobutyrate-resistant, 0- 
hydroxyaspartate-resistant, o-methylthreonine-resistent£. coli strain transformed with 
pantothenate biosynthesis gene-containing DNA fragments and/or branched amino acid 
5 biosynthesis gene-containing DNA fragments and cultured in the presence of p-alanine ' 
(U.S. Patent No. 5,932,457). 

Pantothenate production in bacteria results from the condensation of pantoate 
and p-alanine and involves the pantothenate biosynthetic enzymes ketopantoate 
hydroxymethyltransferase (IhepanB gene product), ketopantoate reductase (foepanE 
10 gene product), aspartate-a-decarboxylase (the panD gene product) and pantothenate 
synthetase (ihepanC gene product). Although pantothenate is biologically active as a 
vitamin, it is further metabolized in all cells to Coenzyme A (CoA) which participates as 
an acyl group carrier in the tricarboxylic acid (TCA) cycle, fatty acid metabolism and 
numerous other reactions of intermediary metabolism. The jnitial (and possibly rate- 

15 , controlling) step in the conversion of pantothenate to Coenzyme A (CoA) is 

phosphorylation of pantothenate by pantothenate kinase. A pantothenate kinase activity 
was first identified in Salmonella typhimwium by screening for temperature-sensitive 
mutants which synthesized CoA at permissive temperatures but excreted pantothenate at 
non-permissive temperatures. The mutations were mapped in the Salmonella 

20 chromosome and the genetic locus was designated coaA. The gene encodes the enzyme 
that catalyzes the first step in the biosynthesis of coenzyme A from pantothenate (Dunn 
and Snell (1979) J. Bacteriol. 140:805-808). Escherichia coli temperature sensitive 
mutants have also been isolated and characterized (Vallari and Rock (1987) J. Bacteriol. 
169:5795-5800). These mutants (named coaA15(Ts)) are defective in the conversion of 

25 pantothenate to CoA and further exhibit a temperature-sensitive growth phenotype, 
indicating that pantothenate kinase activity is essential for growth. Moreover, it was 
noted that CoA inhibited pantothenate kinase activity to the same degree in the mutant 
as compared to the wild-type «nzyme. 

Feedback resistant R coli mutants (named coaA16(Fr)) have also been isolated 

30 that posses a pantothenate kinase activity that is refractory to feedback inhibition by 
CoA (Vallari and Jackowski (1988) J. Bacteriol. 170:3961-3966). The mutation 
responsible for the reversion is, suprisingly, not genetically linked to the coaA gene by 
transduction. Additional data described therein support the view that the total cellular 
CoA content is controlled by both modulation of biosynthesis at the pantothenate kinase 

35 step and possibly by degradation of CoA to 4'-phosphopantetheine. 
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The wild-type E. coli coaA gene was cloned by functional complementation of 
E. coli temperature-sensitive mutants. The sequence of the wild-type gene was 
determined {Song and Jackowski (1992)/. Bacterid. 174:6411-6417 and Fiamm*/ al. 
(1 988) Gene (Amst.) 74:555-558). Strains containing multiple copies of the coaA gene 

5 possessed 76-fold higher specific activity of pantothenate kinase, however, there was 
only a 2.7-fold increase in the steady state level of CoA (Song and Jackowski, supra). It 
has further been reported that the prokaryotic enzyme {encoded by coaA in Ecoli and a 
variety of other microorganisms) is feedback inhibited by CoA both in vivo and in vitro 
with CoA being about five times more potent than acetyl-CoA in inhibiting the enzyme 

10 (Song and Jackowski, supra and Vallari et al, supra). Moreover, it has been reported 
that the panB gene product in E. coli is inhibited by CoA^Powers and Snell (1976) J. 
Biol Chem. 25 1 :3786-3793). These data further-support the vieW that feedback 
inhibition of pantothenate kinase activity is a critical factor controlling intracellular CoA 
concentration. 

15 Using standard search and alignment tools, coaA homologues have been 

identified in Hemophilus influenzae, Mycobacterium tuberculosis, Vibrio cholerae, 
Streptococcus pyogenes and Bacillus subtilis. By-contrast, proteins with significant 
similarity could not be identified in-eukaryotic cells including Saccharomyces cerevisiae 
or in mammalian expressed sequence tag<EST) databases. Using a^genetic selection 

20 strategy, a cDNA encoding pantothenate kinase activity has recently been identified 
from Aspergillus nidulans{CM^ etal. {1999) J Biol Chem. 274:2014-2020). The 
eukaryotic pantothenate kinase gene (panK) has distinct primary -structure and unique 
regulatory properties that clearly distinguish it from its prokaryotic counterpart. A 
mammalian pantothenate kinase gene {mpanKla) has also been isolated which encodes 

25 a protein having homology to the A. nidulahs PanK protein and to the predicted gene 
product of GenBank™ Accession Number 927798 identified in the S. cerevisiae 
genome <Rocke/ al. (2000) J. Biol Chem. 275:1377-1383). 

Summary of the Invention 

30 The present invention is based, at least in part, on the discovery of key enzyme- 

encoding genes of the pantothenate biosynthetic pathway in Bacillus subtilis. In 
particular, the present inventors have identified the paraEgene of B. subtilis. 
Overexpression or deregulation of the panE gene in B. subtUis results in enhanced 
production of the panE gene product, ketopantoate reductase, further resulting in 

3 5 increased production of pantothenate. Likewise, mutations in this gene reduce 
pantothenate production in B. subtilis >90%. The present inventors have further 
identified the presumptive panBCD operon in B. subtilis, overexpression or 
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deregulation of which results in increased pantothenate production. The present 
inventors have further demonstrated that overexpression or deregulation of the panD 
gene in B. subtilis (resulting in enhanced production of Hit panD gene product, 
aspartate-a-decarboxylase) further results in increased production of pantothenate, in 
5 particular, in combination with deregulation of genes encoding key en2ymes of the 
isoleucine-valine (i/v) biosynthetic pathway. 

Accordingly, the present invention features methods of producing pantothenate, 
as well as other compounds of the pantothenate biosynthetic pathway (e.g, 
ketopantoate, pantoate and P-alanine), termed "panto-compounds" herein, using 

1 0 microorganisms in which the pantothenate biosynthetic pathway and/or isoleucine- 
valine biosynthetic pathway has been manipulated such that panto theriate or other 
desired panto-compounds are produced. In one embodiment, the invention features a 
method of producing a panto-compound (e.g. , pantothenate or pantoate) that involves 
culturing a microorganism which overexpresses the panE gene product, ketopantoate 

1 5 reductase, also referred to herein as a ketopantoate reductase-overexpressing or 
"KPAR-O" microorganism, under conditions such that the panto-compound (e.g., 
pantothenate or pantoate) is produced. In another embodiment, the present invention 
features a method of producing panto-compounds (e.g., pantothenate or pantoate) 
which includes culturing a microorganism which overexpresses at least one 

20 pantothenate biosynthetic enzyme (e.g., at least one of the panB,panC or panD gene 
products), preferably in a KPAR-O microorganism, under conditions such that the 
panto-compound (e.g. , pantothenate or pantoate) is produced. 

Yet another aspect of the invention features methods of producing panto- 
compounds which are independent of the need to feed precursors (e.g., P-alanine or 

25 aspartate and/or a-ketoisovalerate or valine). In one embodiment, the invention 

features a method of producing pantothenate in a manner independent of precursor feed 
that includes culturing an aspartate-a-decarboxylase-overexpressing (AaD-O) 
microorganism having a deregulated isoleucine-valine (ilv) pathway under conditions 
such that pantothenate is produced. In another embodiment, the invention features a 

30 method of producing pantothenate in a manner independent of precursor feed that 
includes culturing an AaD-0 microorganism having a deregulated pantothenate (pari) 
pathway and a deregulated isoleucine-valine (z7v) pathway, under conditions such that 
pantothenate is produced. In another embodiment, the invention features a method of 
producing pantothenate in a manner independent of aspartate or P-alanine feed that 

35 includes culturing an AaD-0 microorganism under conditions such that pantothenate is 
produced. In another embodiment, the invention features a method of producing 
pantothenate in a manner independent of valine or a-ketoisovalerate feed that includes 
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• 

culturing a microorganism having a deregulated isoleucine-valine (ttv) biosynthetic 
pathway under conditions such that pantothenate is produced. In yet another 
embodiment, the invention features a high yield production method for producing 
pantothenate that includes culturing a manipulated microorganism under conditions 
5 such that pantothenate is produced at a significantly high yield<e.g. , at a level -greater 
than 10 g/L, 20g/L, 30 g/L or 40g/L). 

The methods of the present invention further feature microorganisms *hat 
overexpresses acetohydroxyacid synthetase or acetohydroxyacid isomeroieductase<eg., 
microorganisms transformed with a vector that includes an UvBNC nucleic acid 
1 0 sequence), microorganisms that overexpresses dihydroxyacid dehydratase (e.g. , 

microorganisms transformed with a vector that includes an ilvD nucleic acid sequence), 
microorganisms that overexpresses aspartate-a-decarboxylase (e.g., microorganisms 
transformed with a vector that includes a panD nucleic acid sequence), microorganisms 
having a deregulated isoleucine-valine (i/v) biosynthetic pathway and microorganisms 
1 5 having a deregulated pantothenate biosynthetic pathway (e.g. , microorganisms that 
overexpress any of ketopantoate hydroxymethyltrans&rase, ketopantoate reductase, 
pantothenate synthetase and aspartate-a-decarboxylase, for example, microorganisms 
transformed with a vector comprising apanBCD nucleic acid sequence or a vector 
comprising a panEl nucleic acid sequence). In one embodiment, the recombinant 
20 microorganism is Gram positive (e.g. , microorganisms belonging to the genus Bacillus, 
Cornyebacterium, Lactobacillus, Lactococci or Streptomyces). In another embodiment, 
the recombinant microorganism is Gram negative. Particularly preferred is a Bacillus 
recombinant microorganism (e.g., Bacillus licheniformis, Bacillus amyloliquefaciens 9 
Bacillus subtiliSy Bacillus pumilus, Bacillus halodurans, and the like). Recombinant 
25 vectors that contain the genes encoding Bacillus pantothenate and/or isoleucine-valine 
biosynthetic enzymes (e.g., B. subtilis pantothenate and/or isoleucine-valine biosynthetic 
enzymes) are also described. 

Also featured are methods of producing ^-alanine that include culturing an 
aspartate-a-decarboxy lase-overexpressing { AaD-O) microorganism under conditions 
30 such that p-alanine is produced and methods of producing p-alanine that involve 
contacting a composition comprising aspartate with an isolated Bacillus aspartate-a- 
decarboxylase enzyme under conditions such that P-alanine is produced. 

The production methods of the present invention further can include recovering 
the panto-compound (e.g., pantothenate or pantoate). 
35 The present invention further features recombinant microorganisms (e.g. , AaD- 

O microorganisms, microorganisms having a deregulated isoleucine-valine^z/v) 
pathway, microorganisms overexpressing at ieast one of ketopantoate 
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hydroxymethyltransferase^the panB gene product), pantothenate synthetase (the panC 
gene product), aspartate-a-decarboxylase (the panD gene product), ketopantoate 
reductase (ihepanEl gene product) and microorganisms having a deregulated panBCD 
, operon. Also featured are panB, panC, panD, panE, ilvB, ilvN 9 alsS, ilvC, and/or ilvD 
5 nucleic acid molecules, as well as vectors including such nucleic acid molecules and 
gene products encoded by such nucleic acid molecules. , 

The methodology of the present invention further includes, for example in 
addition to overexpressing at least one pantothenate biosynthetic enzyme, deleting or 
mutating a second pantothenate biosynthetic enzyme, said second pantothenate 
1 0 biosynthetic enzyme preferably being downstream of the desired product in the 
pantothenate biosynthetic pathway. For example, mutating panC, in addition to 
overexpressing the panE gene product, results in even further enhanced or increased 
production of pantoate. Accordingly, in one embodiment, the invention features a 
method of producing pantoate which includes culturing a microorganism which 
15 ' overexpresses the par/iE gene product and which has a deletion in the panC gene. In 
another embodiment, the invention features a method of producing pantoate which 
includes culturing a microorganism which overexpresses the panE gene product and/or 
panB gene product and which has a deletion in the panC gene: Other exemplary 
embodiments include a method of producing ketopantoate which includes culturing a 
20 microorganism which overexpresses the panB gene product and which has a deletion in 
the panE gene and a method of producing ^-alanine which includes culturing a 
microorganism which overexpresses the panD gene product and which has a deletion in 
ihepanC gene. Also included are methods of producing panto-compounds which 
include overexpressing at least one valine biosynthetic enzyme in a microorganism 
25 which has at least one pantothenate biosynthetic enzyme deleted. 

The present invention is also based at least in part, on the identification and 
characterization of a previously unidentified microbial pantothenate kinase gene, coaX. 
CoaXvtas first identified in Bacillus subtilis and corresponds to an open reading frame 
in a portion of the chromosomal DNA that includes the 5' end of the ftsHgrnc, and all 
30 of the yacB, yacC, yacD, cysK and pabB genes. The present inventors have 

demonstrated that the yacB open reading frame encodes a novel pantothenate kinase 
activity, the gene being unrelated by homology to any previously known pantothenate 
kinase gene. The gene has been renamed coaX, as it encodes the enzyme which 
catalyzes the first step in the pathway from pantothenate to CoaA. 
35 Accordingly, the present invention features new and improved methods of 

producing pantothenate and other key compounds of the pantothenate biosynthetic 
pathway (e.g., panto-compounds) utilizing microorganisms having modified 
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pantothenate kinase activity. In particular, the present invention features recombinant 
microorganisms that contain the coo* gene or that contain a mutant-cp^-gene, having 
reduced pantothenate -kinase activity. In one embodiment, the invention features such 
recombinant microorganisms further having a deregulated pantothenate biosynthetic' 
5 pathway. In another embodiment, the invention features such recombinant 

microorganisms further having a deregulated isoteucine-valine <i/v) pathway. In a 
preferred embodiment, the microorganisms belong to the genus Bacillus {e.g.,B. 
subtilis). 

The present invention also features recombinant microorganisms (e.g., 
10 microorganisms belonging to the-genus Bacillus, for example, B. subtilis) mat contain 
the coaA gene or that contain a mutant coa^gene, optionally including a coaX gene or 
mutant thereof, having reduced pantothenate kinase activity, in one embodiment, the 
invention features such recombinant microorganisms further having a deregulated 
pantothenate biosynthetic pathway or having a deregulated isoleucine-valine<«v) 
15 pathway. 

Also featured are vectors that contain isolated coaXor-coaA genes as well as 
mutant coaX and/or coaA genes. Isolated nucleic acid molecules that contain isolated 
coaA'genes or mutant coaATgenes are featured in addition to isolated CoaX proteins and 
mutant CoaX proteins. 

20 The nucleic acids, vectors and recombinant microorganisms described above are 

particularly useful in the methodologies of the present invention. In particular, the 
invention features methods of enhancing panto-compound production <e.g., 
ketopantoate, pamoate and or pantothenate production) that include culturing a 
recombinant microorganism having a mutant coaX gene under -conditions such that 

25 panto-compound production is^nhanced. In one embodiment, the recombinant 
microorganism further includes a mutant coaA gene. In another embodiment, the 
recombinant microorganism further includes a mutant avtA and/or mutant ilvfi^ene 
and/or mutant ansB gene and/or mutant alsD&as. Also featured are me&odsfor 
identifying pantothenate modulators utilizing the recombinant microorganisms and 

30 purified CoaX proteins of the present invention. 

Other features and advantages of the invention will be apparent feom the 
following detailed description and claims. 
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Brief Description of the Drawings 

Figure 1 is a schematic representation of the pantothenate biosynthetic pathway. 
Figure 2 is a schematic representation of the plasmid pAN240, obtaining 
sequences ligated upstream of the P 26 panBCD cassette, equivalent to the integrated 
' version in strain PA221 . 

Figure 3A is a schematic representation of the plasmid pAN004, containing the 
panBCD operon expressed from P 26 and RBS 1 . 

Figure 3B is a schematic representation of the plasmid pAN006, containing the 
panBCD operon expressed from P 2S and RBS2. 

Figure 4 is a schematic representation of the plasmid pAN236, containing an 
integratable and amplifiable P 26 -RBS2-ponEl expression cassette. 

Figure J is a schematic representation of the construction of plasmid pAN423. 
Figure 6 is a schematic representation of the construction of plasmids pAN426 
andpAN427. 

Figure 7 is a schematic representation of the construction of plasmids pAN428 
and pAN429. 

Figure 8 is a schematic representation of the construction of plasmid pAN431. 

Figure 9 is a schematic representation of the construction of plasmid pAN441 . 

Figure 10 is a schematic representation of the construction of plasmid pAN440. 
20 Figure 11 is a schematic representation of the plasmid pAN251 designed to 

integrate a single copy of a P 26 -panEl cassette at the panEl locus by double crossover. 

Figure 12 is a schematic representation of the plasmid pAN267 designed to 
integrate a single copy of a P 26 -ilvBNC cassette at the omyE locus. 

Figure 13 is a schematic representation of the plasmid pAN257, a clone of 
25 Bacillus subtilis ilvD in a low copy vector. 

Figure 14 is a schematic representation of the plasmid pAN263, designed to 
integrate a single copy of a P 26 -ilvD cassette at the ilvD locus. 

Figure 15 is a schematic representation of the plasmid pAN261, designed to 
disrupt the Bacillus subtilis ilvD gene with the cat gene. 

30 Figure 16 is a schematic representation of the Coenzyme A biosynthetic pathway 

in£ coli. 

Figure 17 is a schematic representation of the structure of pAN296, a plasmid 
designed to delete most of the B. subtilis coaA gene and substitute a chloramphenicol 
resistance gene. 

35 Figure 18 is a schematic representation of the structure of the Bacillus subtilis 

genome in the region of the coaA gene. The scale is in base pairs and the significant 
open reading frames are shown by open arrows. 
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Figure 19 is a schematic representation of the piasmid pAN2S 1 , a plasmid for 
expressing Bacillus subtilis coaA after integration at the bpr locus. 

Figure 20A-B depicts a multiple sequence alignment (MSA) of the amino acid 
sequences encoded by six known or predicted microbial coaA genes. SEQ ID NOs:^ , 
5 and 1-3 correspond to the amino acid sequences of Mycobacterium leprae (SwissProt 11 ^ 
Accession No. Q9X795), Mycobacterium tuberculosis <SwissProt™ Accession No. 
053440), Streptomyces coelicohriSmssPxot™ Accession No. T)86799), Haemophilus 
influenzae (SwissProt™ Accession No. P44793), Escherichia <^/*SwissProt™ 
Accession No. P15044) and Bacillus subtilis {SwissProt™ Accession No. P54556), , 
1 0 respectively. The alignment was generaied using ClustalW MSA software at the 
GenomeNet CLUSTALW Server at the Institute for Chemical Research, Kyoto 
University. The following parameters were used: Pairwise Alignment, K-tuple (word) 
size = 1 , Window size = 5, Gap Penalty = 3, Number of Top Diagonals = 5, Scoring 
Method = Percent; Multiple Alignment, Gap Open Penalty = 10, Gap Extension Penalty . 
1 5 = 0.0, Weight Transition = No, Hydrophilic residues = Gly, Pro, Ser, Asn, Asp, Gin, 
Glu, Arg and Lys, Hydrophobic Gaps = Yes; and Scoring Matrix = BLOSUM. 

Figure 21 is a schematic representation of the structure pf the Bacillus subtilis 
genome in the region of the coaXiyacB) gene. The scale is in base pairs, the significant 
open reading frames are shown by open arrows and -certain predicted restriction 
20 fragments are indicated by thick bars. 

Figure 22 is a schematic representation of the structure of pAN341 and pAN342, 
two independent PCR-derived clones of B. subtilis yacB {remaned herein as coaX). 

Figure 23A-D depicts a multiple sequence alignment {MS A) of the amino acid 
sequences encoded by fourteen known or predicted microbial coaYgenes. SEQ ID 
25 NOs:9, 74, 7-8, 75, 1 1, 10 and 12-18 correspond to the amino acid sequences of BacHlus 
subtilis (SwissProt™ Accession No. P37564), Clostridium acetobulyticum (WIT™ 
Accession No. RCA03301, Argonne National Laboratories), Streptomyces coelicolor 
(PIR™ Accession No. T36391), Mycobacterium tuberculosis XSwissProt™ Accession 
No. 006282), Rhodobacter capsulatusiyflT™ Accession No. RRC02473), 
30 Desulfovibrio vulgaris (DBJ™ Accession No. BAA21476.1), Deinococcus radiodurans 
(SwissProt™ Accession No. Q9RX54), Thermotoga manYzma(GenBank™ Accession 
No. AAD35964.1), Treponema pallidum 4Svnss?rot™ Accession No. 083446), 
Borrelia burgdorferi (SwissProt™ Accession No.05 1477), Aquifex aeolicus 
(SwissProt™ Accession No. 067753), Synechocystis sp. (SwissProt™ Accession No. 
3 5 P74045), Helicobacter pylori (SwissProt™ Accession No. 025533), and Bordetella 
pertussis (SwissProt™ Accession No. Q45338), respectively. The alignment was 
generated using ClustalW MSA software at the GenomeNet CLUSTALW Server at the 
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Institute for Chemical Research, Kyoto University. The following parameters were 
used: Pairwise Alignment, K-tuple (word) size = 1, Window size = 5, Gap Penalty = 3, 
Number of Top Diagonals = 5, Scoring Method = Percent; Multiple Alignment, Gap 
Open Penalty = 10, Gap Extension Penalty = 0.0, Weight Transition = No, Hydrophilic 
5 residues = Gly, Pro, Ser, Asn, Asp, Gin, Glu, Arg and Lys, Hydrophobic Gaps = Yes; 
and Scoring Matrix = BLOSUM. 

Figure 24 depicts a multiple sequence alignment of a portion of the protein 
sequences of the coaA gene products fiom the following microorganisms: Bacillus 
subtilis, Escherichia coli, Haemophilus influenzae, Mycobacterium leprae, 

1 0 Mycobacterium tuberculosis, and Streptomyces coelicolor. The residues that are 

mutated in E. coli coaA15(Ts) and B. subtilis coaA282A are indicated below and above 
the alignment, respectively. The portions correspond to amino acid residues 168-1 87 of 
SEQIDNO:3, 167-186 of SEQ ID NO:2, 165-184 of SEQ ID NO: 1, 169-188 of SEQ 
ID NO:4, 169-188 of SEQ ID NO:5 and 179-198 of SEQ ID NO:6, respectively. 

15 Figure 25 is a schematic representation of the structure of pAN294, a plasmid 

for integrating mutagenized B. subtilis coaA at its native locus. 

Figure 26 is a schematic representation of the structure of pAN336, a plasmid 
designed to delete B. subtilis coaX 'from its chromosomal locus and replace it with a 
kanamycin resistence gene. 

20 

Detailed Description of the Invention 

The present invention features new and improved methods of producing 
pantothenate and other key compounds of the pantothenate biosynthetic pathway 
(referred to herein as "panto-compounds", for example, pantothenate, ketopantoate, 
25 pantoate and p-alanine) using microorganisms in which the pantothenate biosynthetic 
pathway has been manipulated such that pantothenate or other desired panto-compounds 
are produced. 

The new and improved methodologies of the present invention include methods 
of producing panto-compounds {e.g., pantothenate) in microorganisms having at least 

30 one enzyme of the pantothenate biosynthetic pathway manipulated such that 

pantothenate or other desired panto-compounds are produced {e.g., produced at an 
increased level). For example, the invention features methods of producing panto- 
compounds {e.g., pantothenate) in microorganisms having at least one of ketopantoate 
hydroxymethyltransferase, ketopantoate reductase, pantothenate synthetase or aspartate- 

35 a-decarboxylase manipulated such that pantothenate or other desired panto-compounds 
are produced. The methodologies of the present invention also include methods of 
producing panto-compounds {e.g., pantothenate) in microorganisms having at least one 
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valine-isoleucine biosynthetic enzyme, described herein, manipulated such that 
pantothenate or other desired panto-compounds are produced. For example, the 
invention features methods of producing panto-compounds (e.g., pantothenate) in 
microorganisms having at least one of acetohydroxyacid synthetase, aeetohydroxyaeid 
5 isomeroreductase or dihydroxyacid dehydratase manipulated such that pantothenate or 
other desired panto-compounds are produced. 

The invention also featuies methods of producing panto-compounds that involve 
culturing a ketopantoate reductase-oveiexpressing (KPAR-O) microorganism under 
conditions such that the panto-compound is produced. The invention also features 
1 0 methods of producing pantothenate in a manner independent of precursor feed that 
. involve culturing an aspartate-a-decarboxylase-overexpressing {AaD-O) microorganism 
under conditions such that pantothenate is produced. Also featured are ^-alanine 
independent high yield pantothenate production methods as well as methods of 
producing P-alanine. The present invention also featuies methods for enhancing 
15 production of panto-compounds that involve culturing pantothenate kinase mutants. In 
particular, the present invention features new and improved methods of producing 
pantothenate and other key compounds of the pantothenate biosynthetic pathway {e.g. , , 
panto-compounds) utilizing microorganisms having modified pantothenate -kinase 
activity, for example, microorganisms that include the coaX gene or that include a 
20 mutant coaATgene, having reduced pantothenate kinase activity. 

In order that the present invention may be more readily understood, -certain terms 
are first defined herein. 

The term "pantothenate biosynthetic pathway" includes the biosynthetic pathway 
involving pantothenate biosynthetic enzymes {e.g. , polypeptides encoded by 
25 biosynthetic enzyme-encoding genes), compounds (e.g. , precursors, substrates, 

intermediates or products), cofactors and the like utilized in the formation or synthesis 
of pantothenate. The term "pantothenate biosynthetic pathway" includes the 
biosynthetic pathway leading to the synthesis of pantothenate in a microorganisms <e.g. , 
in vivo) as well as the biosynthetic pathway leading to the*synthesis of pantothenate in 
30 vitro. Figure 1 includes a schematic representation of the pantothenate biosynthetic 
pathway. Pantothenate biosynthetic enzymes are depicted in bold and their 
corresponding genes indicated in italics. 

The term "pantothenate biosynthetic enzyme" includes any -enzyme utilized in 
the formation of a compound (e.g. , intermediate or product) of the pantothenate 
35 biosynthetic pathway. According to Figure 1, synthesis of pantoate from a- 

ketoisovalerate (a-KTV) proceeds via the intermediate, ketopantoate. Formation of 
ketopantoate is catalyzed by the pantothenate biosynthetic enzyme ketopantoate 



8N900CU} <WO 0124772A3.IA> 



WO 01/021772 



-13- 



PCT/US00/25993 



hydroxymethyltransferase (the panB gene product). Formation of pantoate is catalyzed 
by the pantothenate biosynthetic enzyme ketopantoate reductase (the panE gene 
product). Synthesis of P-alanine from aspartate is catalyzed by the pantothenate 
biosynthetic enzyme aspartate-a-decarboxylase (the panD gene product). Fonnation of 
5 pantothenate from pantoate and p-alanine (e.g. , condensation) is catalyzed by the 
pantothenate biosynthetic enzyme pantothenate synthetase (the panC gene product). 

The term "isoleucine-valine biosynthetic pathway" includes the biosynthetic 
pathway involving isoleucine-valine biosynthetic enzymes (e.g., polypeptides encoded 
by biosynthetic enzyme-encoding genes), compounds (e.g., precursors, substrates, 

1 0 intermediates or products), cofactors and the like utilized in the formation or synthesis 
of conversion of pyruvate to valine or isoleucine. The term "isoleucine-valine 
biosynthetic pathway" includes the biosynthetic pathway leading to the synthesis of 
valine or isoleucine in a microorganisms (e.g., in vivo) as well 'as the biosynthetic 
pathway leading to the synthesis of valine or isoleucine in vitro. Figure 1 includes a 

1 5 schematic representation of the isoleucine-valine biosynthetic pathway. Isoleucine- 
valine biosynthetic enzymes are depicted in bold italics and their corresponding genes 
indicated in italics 

The term "isoleucine-valine biosynthetic enzyme" includes any enzyme utilized 
in the fonnation of a compound (e.g., intermediate or product) of the isoleucine-valine 

20 biosynthetic pathway. According to Figure 1 , synthesis of valine from pyruvate 

proceeds via the intermediates, acetolactate, a,p-dihydroxyisovalerate (a,p-DHIV) and 
a-ketoisovalerate (a-KIV). Formation of acetolactate from pyruvate is catalyzed by the 
isoleucine-valine biosynthetic enzyme acetohydroxyacid synthetase (the itvBN gene 
product, or alternatively, the alsS gene product). Formation of a,p-DfflV from 

25 acetolactate is catalyzed by the isoleucine-valine biosynthetic enzyme 

acetohydroxyacidisomero reductase (the ilvC gene product). Synthesis ofa-KTV from 
a,p-DHIV is catalyzed by the isoleucine-valine biosynthetic enzyme dihydroxyacid 
dehydratase (the ilvD gene product). Moreover, valine and isoleucine can be 
interconverted by branched chain amino acid transaminases. 

30 As used herein, each of ketopantoate, pantoate, P-alanine and pantothenate are 

"panto-compounds". The term "panto-compound" includes a compound (e.g., a 
substrate, intermediate or product) in the pantothenate biosynthetic pathway which is 
downstream from a particular pantothenate biosynthetic enzyme. In one example, a 
panto-compound is downstream of the pantothenate biosynthetic enzyme ketopantoate 

3 5 hydroxymethyltransferase (the panB gene product) and can include ketopantoate, 

pantoate and/or pantothenate. In another example, a panto-compound is downstream of 
the pantothenate biosynthetic enzyme ketopantoate reductase (the panE gene product) 
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and can include pantoate and/or pantothenate. In yet anotber -example, a panto- 
compound is downstream of the pantothenate biosynthetic enzyme pantothenate 
synthetase (the panC gene product) and can include pantothenate. In yet another 
' example, a panto-compound is downstream of the pantothenate biosynthetic enzyme 
5 aspartate-a-decarboxylase (the panD gene product) andean include 0-alanine and/or 
pantothenate. 

Preferred panto-compounds include pantothenate and pantoate. The term 
"pantothenate" includes the free acid form of pantothenate, also referred to as 
"pantothenic acid" as well as any salt thereof {e.g. , derived by replacing the acidic , 
1 0 hydrogen of pantothenate or pantothenic acid with acation,-fbr-example, -calcium, 
sodium, potassium, ammonium), also referred to as a "pantothenate salt". The term 
"panto-compound" also includes alcohol derivatives of pantothenate. Preferred 
pantothenate salts are calcium pantothenate or sodium pantothenate. A.preferred 
, alcohol derivative is pantothenol. Pantothenate salts and/or alcohols of die present 
1 5 invention include salts and/or alcohols prepared v/a conventional methods from the fr-ee 
acids described herein. In another embodiment, -calcium pantothenate is synthesized 
directly by a microorganism of the.present invention. A pantothenate salt of the present , 
invention can likewise be converted to a free acid form of pantothenate or pantothenic 
acid by conventional methodology. 
20 The term "pantoate" includes the free acid form of pantoate, also referred to as 

"pantoic acid" as well as any salt thereof (e.g. , derived by replacing the acidic hydrogen 
of pantoate or pantoic acid with a cation, for example, cafcium, sodium, potassium, 
ammonium), also referred to as a "pantoate salt". Preferred pantoate salts a*e calcium 
pantoate or sodium pantoate. Pantoate^alts of the present invention include salts 
25 prepared via conventional methods from the free acids described herein. A pantoate salt 
of the present invention can likewise be converted to a free acid form of pantoate or 
pantoic acid by conventional methodology. Moreover, a free acid form of pantoate or 
pantoic acid can be converted to pantolactone by conventional methodology. 

The term "CoA biosynthetic pathway" includes the biosynthetic pathway 
30 involving CoA biosynthetic enzymes {e.g. , polypeptides encoded by biosynthetic 
enzyme-encoding genes), compounds (e.g., precursors, substrates, intermediates or 
products), cofactors and the like utilized in the formation or synthesis of CoA from 
pantothenate. A schematic representation of the CoA biosynthetic pathway in E. coli is 
set forth as Figure 1 6. (The pathway depicted is also presumed to be that utilized by 
35 other microorganisms.) The term "CoA biosynthetic pathway" includes the biosynthetic 
pathway leading to the synthesis of CoA in microorganisms (e.g., in vivo) as well as the 
biosynthetic pathway leading to the synthesis of CoA in vitro. The term "Coenzyme A 
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or Co A biosynthetic enzyme" includes any enzyme utilized in the formation of a 
compound (e.g., intermediate or product) of the CoA biosynthetic pathway, for example, 
the coaA,panK or coaX gene product which catalyzes the phosphorylation of 
pantothenate to form 4'-phosphopantothenate, or the coaD gene product which catalyzes 
5 the conversion of 4'-phosphopantetheine to dephosphocoenzyme A. 

I Recombinant Microorganisms and Methods for Culturing 
Microorganisms Such That Panto-Compounds are Produced 

The methodologies of the present invention feature microorganisms, e.g., 

1 0 recombinant microorganisms, preferably including vectors or genes (e.g. , wild-type 
and/or mutated genes) as described herein and/or cultured in a manner which results in 
the production of a desired product (e.g. a panto-compound or panto-compounds). The 
term "recombinant" microorganism includes a microorganism 1 ^. g. 9 bacteria, yeast cell, 
fiingal cell, etc.) which has been genetically altered, modified or engineered (e.g., 

1 5 genetically engineered) such that it exhibits an altered, modified or different genotype 
and/or phenotype (e.g., when the genetic modification affects coding nucleic acid 
sequences of the microorganism) as compared to the naturally-occurring microorganism 
from which it was derived. Preferably, a "recombinant" microorganism of the present 
invention has been genetically engineered such that it overexpresses at least one 

20 bacterial gene or gene product (e.g. , a pantothenate or isoleucine-valine biosynthetic 

enzyme encoding-gene) as described herein, preferably a biosynthetic enzyme encoding- 
gene included within a recombinant vector as described herein and/or a biosynthetic 
enzyme expressed from a recombinant vector. The ordinary skilled will appreciate that 
a microorganism expressing or overexpressing a gene product produces or overproduces 

25 the gene product as a result of expression or overexpression of nucleic acid sequences 
and/or genes encoding the gene product. 

The term "manipulated microorganism" includes a microorganism that has been 
engineered (e.g., genetically engineered) or modified such that the microorganism has at 
least one enzyme of the pantothenate biosynthetic pathway and/or at least one enzyme of 

30 the isoleucine-valine biosynthetic pathway modified such that pantothenate or other 
desired panto-compounds are produced. Modification or engineering of such 
microorganisms can be according to any methodology described herein including, but 
not limited to, deregulation of a biosynthetic pathway and/or overexpression of at least 
one biosynthetic enzyme. A "manipulated" enzyme (e.g., a "manipulated" biosynthetic 

35 enzyme) includes an enzyme, the expression or production of which has been altered or 
modified such that at least one upstream or downstream precursor, substrate or product 
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of the enzyme is altered or modified, for -example, as compared to a corresponding wiW- 
type or naturally occurring enzyme. 

The term "overexpressed" or "overexpression" includes expression of a,gene 
product {e.g. , a pantothenate biosynthetic enzyme or isoteucine-valine biosynthetic 
5 enzyme) at a level greater than that-expressed prior to manipulation x>f the 

microorganism or in a comparable microorganism which has not been manipulated. In 
one embodiment, the microorganism can be genetically manipulated (e.g. , genetically 
engineered) to overexpress a level of gene product greater than that^xpressed prior to 
manipulation of the microorganism or in a comparable microorganism which has not , 
10 been manipulated. Genetic manipulation can include, but is not limited to, altering or 
modifying regulatory sequences or sites associated with expression Of a particular gene 
(e.g. , by adding strong promoters, inducible promoters or multiple promoters or by 
removing regulatory sequences such that-expression is constitutive), modifying tiie 
-chromosomal location of a particular gene, altering nucleic acid sequences adjacent to a 
1 5 particular gene such as a ribosome binding site or transcription terminator, increasing 
the copy number of a particular gene, modify mg^roteins (e.g., regulatory proteins, 
suppressors, enhancers, transcriptional activators and the like) involved in transcription 
of a particular gene and/or translation of a particular gene product, or any other 
conventional means of deregulating expression of a particular-gene routine in the art 
20 (including but not limited to use of antisense nucleic acid molecules, for example, to 
block expression of repressor proteins). 

In another embodiment, the microorganism can be physically or environmentally 
manipulated to overexpress a level of gene product greater than that expressed prior to 
manipulation of the microorganism or in a comparable microorganism which has not 
25 been manipulated. For example, a microorganism can be treated with orcultured in the 
presence of an agent known or suspected to increase transcription of a particular gene 
and/or translation of a particular gene product such that transcription and/or translation 
are enhanced or increased. Alternatively, a microorganism can be cultured at a 
temperature selected to increase transcription of a particular gene and/or translation of a 
30 particular gene product such that transcription and/or translation are enhanced or 
increased. 

The term "deregulated" or "deregulation" includes the alteration or modification 
of at least one gene in a microorganism that encodes an enzyme in a biosynthetic 
pathway, such that the level or activity of the biosynthetic enzyme in the microorganism 
35 is altered or modified. Preferably, at least one gene that encodes an enzyme in a 

biosynthetic pathway is altered or modified such that the gene product is enhanced or 
increased. The phrase "deregulated pathway"«an also include a biosynthetic-pathway in 
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which more than one gene that encodes an enzyme in a biosynthetic pathway is altered 
or modified such that the level or activity of more than one biosynthetic en2yme is 
altered or modified. The ability to "deregulate" a pathway {e.g., to simultaneously 
deregulate more than one gene in a given biosynthetic pathway) in a microorganism 
5 arises from the particular phenomenon of microorganisms in which more than one 
enzyme {e.g., two or three biosynthetic enzymes) are encoded by genes occurring 
adjacent to one another on a contiguous piece of genetic material termed an "operon". 

The term "operon" includes a coordinated unit of gene expression that contains a 
promoter and possibly a regulatory element associated with one or more, preferably at 

1 0 least two, structural genes (e. g. , genes encoding enzymes, for example, biosynthetic 
enzymes). Expression of the structural genes can be coordinately regulated, for 
example, by regulatory proteins binding to the regulatory element or by anti-termination 
of transcription. The structural genes can be transcribed to givte a single mRNA that • 
encodes all of the structural proteins. Due to the coordinated regulation of genes 

1 5 included in an operon, alteration or modification of the single promoter and/or 

regulatory element can result in alteration or modification of each gene product encoded 
by the operon. Alteration or modification of the regulatory element can include, but is 
not limited to removing the endogenous promoter and/or regulatory element(s), adding 
strong promoters, inducible promoters or multiple promoters or removing regulatory 

20 sequences such that expression of the gene products is modified, modifying the 
chromosomal location of the operon, altering nucleic acid sequences adjacent to the 
operon or within the operon such as a ribosome binding site, increasing the copy number 
of the operon, modifying proteins {e.g., regulatory proteins, suppressors, enhancers, 
transcriptional activators and the like) involved in transcription of the operon and/or 

25 translation of the gene products of the operon, or any other conventional means of 
deregulating expression of genes routine in the art (including but not limited to use of 
antisense nucleic acid molecules, for example, to block expression of repressor 
proteins). Deregulation can also involve altering the coding region of one or more genes 
to yield, for example, an enzyme that is feedback resistant or has a higher or lower 

30 specific activity. 

A particularly preferred "recombinant' * microorganism of the present invention 
has been genetically engineered to overexpress a bacterially-derived gene or gene 
product. The term "bacterially-derived" or "derived-from" for example bacteria, 
includes a gene which is naturally found in bacteria or a gene product {e.g., ketopantoate 

35 hydroxymethyltransferase, ketopantoate reductase, pantothenate synthetase, aspartate-a- 
decarboxylate, acetohydroxyacid synthetase, acetohydroxyacid isomeroieductase or 
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dihydroxyacid dehydratase) which is encoded by a bacterial gene<*.g., encoded by 
panB, panE, panC, panD, ilvB, ilvN, alsS, ilvC, or ilvD). 

The methodologies of the present invention feature recombinant microorganisms 
which overexpress at least one of ketopantoate hydroxymethyltransferase, ketopantoate 
5 reductase, pantothenate synthetase or aspartate-a-decaiboxylase. A particularly 
preferred recombinant microorganism of the present invention has been genetically 
engineered to overexpress a Bacillus {e.g., Bacillus Hchenifoitnis, Bacillus 
amyloliquefaciens, Bacillus halodurans, Bacillus subtilis, and Bacillus pumilus, etc.) 
biosynthetic enzyme (e.g. , has been engineered to overexpress at least one of A subtilis 
1 0 ketopantoate reductase (the panE gene product) {e.g. , ketopantoate reductase having the 
amino acid sequence of SEQ ID NO:30 or encoded by the nucleic acid sequence of SEQ 
ID NO:29), B. subtilis ketopantoate hydroxymethyltransferase<the panBgene product) 
{e.g., ketopantoate hydroxymethyltransferase having the amino acid sequence of SEQ 
ID NO:24 or encoded by a nucleic acid molecule having the nucleotide sequence of 
15 SEQ ID NO:23), B. subtilis pantothenate synthetase<the panC-gene product) <e,g., 

pantothenate synthetase having the amino acid sequence of SEQ ID NO:26 or encoded . 
by a nucleic acid molecule having die nucleotide sequence of SEQ ID NO:25) and/or B: 
subtilis aspartate-a-decarboxylase {the panD gene product) (e.g., aspartate-a- 
decarboxylase having the amino acid sequence of SEQ ID NO:28 or encoded by a 
20 nucleic acid molecule having the nucleotide sequence of SEQ ID NO:27). 

In an exemplary embodiment, the invention features a microorganism £e. g. , a 
KPAR-0 microorganism) that has been transformed with a vector comprising apanE 
nucleic acid sequence (e.g., a panE nucleic acid seqeunce as set forth in SEQ ID 
NO:29). In another embodiment, the invention features a microorganism that has been 
25 transformed with a vector comprising a panB nucleic acid sequence(eg., a panB nudeic 
acid sequence as set forth in SEQ ID NO:23), a vector comprising apanC nucleic acid 
sequence (e.g., a panC nucleic acid sequence as set forth in SEQfl) NO:25) or a vector 
comprising a panD nucleic acid sequence (eg. , a panD nucleic acid sequence as set 
forth in SEQ ID NO:27). In yet another embodiment, the invention features a 
30 microorganism having a deregulated panBCD operon {e.g. , SEQ ID NO:59). 

Other preferred "recombinant" microorganisms of the present invention have a 
deregulated isoleucine-valine (ilv) pathway. The phrase "microorganism having a 
deregulated isoleucine-valine (ilv) pathway" includes a microorganism having an 
alteration or modification in at least one gene encoding an enzyme of the isoleucine- 
35 valine (ilv) pathway or having an alteration or modification in an operon including more 
than one gene encoding an enzyme of the isoleucine-valine (i/v) pathway. A preferred 
"microorganism having a<ieregulated isoleucine-valine (ilv) pathway" has been 
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genetically engineered to overexpress a Bacillus (e.g., B. subtilis) ilv biosynthetic 
enzyme (e.g., has been engineered to overexpress at least, one of acetohydroxyacid 
synthetase (the ilvBN gene products or the alsS gene product) (e.g., acetohydroxyacid 
synthetase having subunits having the amino acid sequences of SEQ ID NO:32 and SEQ 
5 ID NO:34 or encoded by nucleic acid molecules having the nucleotide sequence of SEQ • 
ID NO:3 1 and SEQ ID NO:33 or the nucleotide sequence of SEQ ID NO:58 from 
nucleotides 1-2246 or acetohydroxyacid synthetase encoded by a nucleic acid molecule 
having the nucleotide sequence of SEQ ID NO:86), acetohydroxyacid isomeroreductase 
(the ilvC gene product) (e.g., acetohydroxyacid isomeroreductase having the amino acid 

1 0 sequence of SEQ ID NO:36 or encoded by a nucleic acid molecule having the 

nucleotide sequence of SEQ ID NO:35), dihydroxyacid dehydratase (the ilvD gene 
product) (e.g. , dihydroxyacid dehydratase having the amino acid sequence of SEQ ID 
NO:38 or encoded by a nucleic acid molecule having the nucleotide sequence of SEQ 
ID NO:37), and/or has been transformed with a vector comprising an ilvBNC nucleic 

15 , acid sequence (SEQ ID NO:58, coding regions from nucleotides 1-1725, 1722-2246 and 
2263-3291) and/or an ilvD nucleic acid sequence (SEQ ID NO:37). 

In another preferred embodiment, a recombinant microorganism is designed or 
engineered such that a mutant CoaA and/or CoaX biosynthetic enzyme is expressed and . 
at least one pantothenate biosynthetic enzyme and/or at least one isoleucine-valine 

20 biosynthetic enzyme is overexpressed or deregulated. 

In another preferred embodiment, a microorganism of the present invention 
overexpresses or is mutated for a gene or biosynthetic enzyme {e.g., a CoA biosynthetic 
enzyme, pantothenate biosynthetic enzyme or isoleucine-valine biosynthetic enzyme) 
which is bacterially-derived. The term "bacterially-derived" or "derived-ftt>m", for 

25 example bacteria, includes a gene product (e.g. , ketopantoate hydroxymethyltransferase, 
ketopantoate reductase, pantothenate synthetase, aspartate-a-decarboxylate, 
acetohydroxyacid synthetase, acetohydroxyacid isomeroreductase, dihydroxyacid 
dehydratase or pantothenate kinase) which is encoded by a bacterial gene (e.g.,pcmB, 
panE,panC y panD, ilvBN (or alsS), i/vC, ilvD, or encoded by coaA or coaX). 

30 Still other preferred recombinant microorganisms of the present invention are 

mutant microorganisms. As used herein, the term "mutant microorganism" includes a 
recombinant microorganism that has been genetically engineered to express a mutated 
gene or protein that is normally or naturally expressed by the microorganism. 
Preferably, a mutant microorganism expresses a mutated gene or protein such that the 

35 microorganism exhibits an altered, modified or different phenotype (e.g., has been 

engineered to express a mutated CoaA biosynthetic enzyme, for example, pantothenate 
kinase). In one embodiment, a mutant microorganism is designed or engineered such 
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that it includes a mutant coaX gene, as defined herein. In another embodiment, a 
recombinant microorganism is designed or engineered such that it includes a mutant 
coaA gene, as definedherein. In another^mbodiment, a mutant microorganism is ( 
designed or engineered such that a coaAgene has been deleted ij.e. , the protein encoded 
5 by the coaX gene is not produced). In another embodiment, a mutant microorganism is 
designed or engineered such that a coaA gene has been deleted (z.e. , the protein encoded 
by the coaA gene is not produced). Preferably, a mutant microorganism has a mutant 
coaX gene or a mutant coaA -gene, or has been engineered to have a -coaX gene and/or 
coaA deleted, such that that the mutant microorganism encodes a "reduced -pantothenate 
1 0 kinase activity". In the context of a whole microorganism, a "reduced pantothenate 
kinase activity" can be determined by measuring or assaying for a decrease in an 
intermediate or product of the Co A biosynthetic pathway, for example, measuring or 
assaying for 4'-phosphopantothenate, 4 , -phosphopantothenylcysteine, 4'- 
phosphopantetheine, dephosphocoenzyme A, Coenzyme A, apo-acyl carrier protein 
1 5 (apo-ACP) or holo-acyl earner protein<ACP) in the microorganism {e.g. , in a lysate 
isolated or derived from the microorganism) or in the medium in which the 
microorganism is cultured (see e.g., Figure 16). Alternatively, a "reduced pantothenate ' 
kinase activity" can be determined by measuring or assaying for decreased growth of the 
microorganism. Alternatively, a "reduced pantothenate -kinase activity" can be 
20 determined by measuring or assaying for an increase in a panto-compound<(e.g. , 
pantothenate) in the microorganism or surrounding media, as panto-compounds lie 
upstream of the CoA biosynthetic pathway, the first step of which is catalyzed by 
pantothenate kinase. The invention also features recombinant microorganisms that, in 
addition to having reduced pantothenate kinase activity {e.g., expressing mutant coaA 
25 and/or mutant coaX genes) have a deregulated pantothenate biosynthesis pathway and/or 
a deregulated isoleucine-valine (z7v) biosynthetic pathway. 

In one embodiment, a recombinant microorganism of the present invention is a 
Gram positive organism (e.g., a microorganism which retains basic dye, for example, 
crystal violet, due to the presence of atjram-positive wall surrounding the 
30 microorganism). In a preferred embodiment, the recombinant microorganism is a 
microorganism belonging to a genus selected from the group consisting of Bacillus, 
Cornyebacterium, Lactobacillus, Lactococci and Streptomyces. In a more preferred 
embodiment, the recombinant microorganism is of the genus Bacillus. In another 
preferred embodiment, the recombinant microorganism is selected from the group 
35 consisting of Bacillus subtilis, Bacillus lentimorbus, Bacillus lentus, Bacillus firmus, 
Bacillus pantothenticus, Bacillus amyloliquefaciens, Bacillus -cereus, Bacillus circukms, 
Bacillus coagulans, Bacillus licheniformxs, Bacillus megqterium, Bacillus pumilus, 
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Bacillas thuringietms, and other Group 1 Bacillus species, for example, as characterized 
by 16S rRNA type (Priest (1993) in Bacillus subtilis andOther Gram-Positive Bacteria 
eds. Sonenshein et ah, ASM, Washington, D.C., p. 6). In another preferred 
embodiment, the recombinant microorganism is Bacillus brevis or Bacillus 
5 stearothermophilus. In another preferred embodiment, the recombinant microorganism 
is selected from the group consisting of Bacillus licheniformis, Bacillus 
amyloliquefaciens, Bacillus halodurans, Bacillus subtilis, and Bacillus pwnilus. 

In another embodiment, the recombinant microorganism is a Gram negative 
(excludes basic dye) organism. In a preferred embodiment, the recombinant 

10 microorganism is a microorganism belonging to a genus selected from the group 
consisting of Salmonella, Escherichia, Klebsiella, Serratia, and Proteus. In a more 
preferred embodiment, the recombinant microorganism is of the genus Escherichia. In 
an even more preferred embodiment, the recombinant microorganism is Escherichia 
coli. In another embodiment, the recombinant microorganism is Saccharomyces (e.g., S. 

15 . cerevisiae). 

An important aspect of the present invention involves culturing the recombinant 
microorganisms described herein, such that a desired compound (e.g., a desired panto- 
compound) is produced. The term "culturing" includes maintaining and/or growing a 
living microorganism of the present invention (e.g., maintaining and/or growing a 

20 culture or strain). In one embodiment, a microorganism of the invention is cultured in 
liquid media. In another embodiment, a microorganism of the invention is cultured in 
solid media or semi-solid media. In a preferred embodiment, a microorganism of the 
invention is cultured in media (e.g., a sterile, liquid media) comprising nutrients 
essential or beneficial to the maintenance and/or growth of the microorganism (e.g., 

25 carbon sources or carbon substrate, for example complex carbohydrates such as bean or 
grain meal, starches, sugars, sugar alcohols, hydrocarbons, oils, fats, fatty acids, organic 
acids and alcohols; nitrogen sources, for example, vegetable proteins, peptones, peptides 
and amino acids derived from grains, beans and tubers, proteins, peptides and amino 
acids derived form animal sources such as meat, milk and animal byproducts such as 

30 peptones, meat extracts and casein hydrolysates; inorganic nitrogen sources such as 
urea, ammonium sulfate, ammonium chloride, ammonium nitrate and ammonium 
phosphate; phosphorus sources, for example, phosphoric acid, sodium and potassium 
salts thereof; trace elements, for example, magnesium, iron, manganese, calcium, 
copper, zinc, boron, molybdenum, and/or cobalt salts; as well as growth factors such as 

35 amino acids, vitamins, growth promoters and the like). 
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Preferably, microorganisms t>f the present invention are •cultured under 
controlled pH. The term "controlled pH" includes any pH which results in production of 
the desired product {e.g. , a panto-compound). In one embodiment, microorganisms are 
cultured at a pH of about 7. In another embodiment, microorganisms are cultured at a 
5 pH of between 6.0 and 8.5. The desired pH may be maintained by any number of 
methods known to those skilled in the art. 

Also preferably, microorganisms of the present invention are cultured under 
controlled aeration. The term "controlled aeration" includes sufficient aeration (e.g., 
oxygen) to result in production of the desired product {e.g., panto-compound). In one 
1 0 embodiment, aeration is controlled by regulating oxygen levels in the culture, for 
example, by regulating the amount of oxygen dissolved in culture media. Preferably, 
aeration of the culture is controlled by agitating the culture. Agitation may be provided 
by a propeller or similar mechanical agitation equipment, by revolving or shaking the 
growth vessel {e.g., fermentor) or by various pumping equipment. Aeration may be 
1 5 further controlled by the passage of sterile air or oxygen through the medium (e.g., 
through the fermentation mixture). Also preferably, microorganisms of the present 
invention are cultured without excess foaming (e.g., via addition of antifoaming agents). 

Moreover, microorganisms of the present invention «can be cultured under 
controlled temperatures. The term "controlled temperature" includes any temperature 
20 which results in production of the desired jproduct <€.#., a panto-compound). In one 
embodiment, controlled temperatures include temperatures between 1 5°C and 95°C. in 
another embodiment, controlled temperatures include temperatures between 15°C and 
70°C. Preferred temperatures are between 20°C and 55°C, more preferably between 
30°C and 45°C or between 30°C and 50°C. 
25 Microorganisms can be cultured {e.g. , maintained and/or grown) in liquid media 

and preferably are cultured, either continuously or intermittently, by conventional 
culturing methods such as standing culture, test tube culture, shaking culture {e.g. , rotary 
shaking culture, shake flask culture, etc.), aeration spinner-culture, or fermentation. In a 
preferred embodiment, the microorganisms are cultured in shake flasks. In a more 
30 preferred embodiment, the microorganisms are cultured in a fermentor (e.g. , a 

fermentation process). Fermentation processes of the present invention include, but are 
not limited to, batch, fed-batch and continuous methods of fermentation. The phrase 
"batch process" or "batch fermentation" refers to a closed system in which the 
composition of media, nutrients, supplemental additives and the like is -set at the 
35 beginning of the fermentation and nofsubject to alteration during the fermentation, 

however, attempts may be made tocontroi such factors as pH and oxygen -concentration 
to prevent excess media acidification and/or microorganism death. The phrase "fed- 
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batch process" or "fed-batch" fermentation refers to a batch fermentation with the 
exception that one or more substrates or supplements are added {e.g., added in 
increments or continuously) as the fermentation progresses. The phrase "continuous 
process" or "continuous fermentation" refers to a system in which a defined 
5 fermentation media is added continuously to a fermentor and an equal amount of used or 
"conditioned" media is simultaneously removed, preferably for recovery of the desired 
product (e.g. , panto-compound). A variety of such processes have been developed and 
are well-known in the art. 

The phrase "culturing under conditions such that a desired compound (e.g., a 

1 0 panto-compound, for example, pantothenate) is produced" includes maintaining and/or 
growing microorganisms under conditions (e.g., temperature, pressure, pH, duration, 
etc.) appropriate or sufficient to obtain production of the desired compound or to obtain 
desired yields of the particular compound being produced. For example, culturing is 
continued for a time sufficient to produce the desired amount, of a panto-compound (e.g., 

15 pantothenate, pantoate or p-alanine). Preferably, culturing is continued for a time 
sufficient to substantially reach maximal production of the panto-compound. In one 
embodiment, culturing is continued for about 12 to 24 hours. In another embodiment, 
culturing is continued for about 24 to 36 hours, 36 to 48 hours; 48 to 72 hours, 72 to 96 
hours, 96 to 120 hours, 120 to 144 hours, or greater than 144 hours. In another 

20 embodiment, culturing is continued for a time sufficient to reach production yields of 
panto-compound, for example, cells are cultured such that at least about 15 to 20 g/L of 
panto-compound are produced, at least about 20 to 25 g/L panto-compound are 
produced, at least about 25 to 30 g/L panto-compound are produced, at least about 30 to 
35 g/L panto-compound are produced, at least about 35 to 40 g/L panto-compound are 

25 produced (e.g. , at least about 37 g/L panto-compound) or at least about 40 to 50 g/L 
panto compound are produced. In yet another embodiment, microorganisms are 
cultured under conditions such that a preferred yield of panto-compound, for example, a 
yield within a range set forth above, is produced in about 24 hours, in about 36 hours, in 
about 48 hours, in about 72 hours, or in about 96 hours. 

30 The methodology of the present invention can further include a step of 

recovering a desired compound (e.g., a panto-compound). The term "recovering" a 
desired compound (e.g., a panto-compound) includes extracting, harvesting, isolating or 
purifying the compound from culture media. Recovering the compound can be 
performed according to any conventional isolation or purification methodology known 

35 in the art including, but not limited to, treatment with a conventional resin (e.g. , anion or 
cation exchange resin, non-ionic adsoiption resin, etc.), treatment with a conventional 
adsorbent (e.g., activated charcoal, silicic acid, silica gel, cellulose, alumina, etc.), 
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alteration ofpH, solvent extraction-(e.g., with a conventional solvent such as an alcohol, 
ethyl acetate, hexane and the like), dialysis, filtration, concentration, crystallization, 
recrystallization, pH adjustment, lyophiiization and the like. For example, a compound 
(e.g. , a panto-compound) can be recovered from culture media by first removing the 
5 microorganisms from the culture. Media is then passed through or over a cation 

exchange resin to remove unwanted cations and then through or over an anion exchange 
resin to remove unwanted inorganic anions and organic acids having -stronger acidities 
than the panto-compound of interest (eg., pantothenate). The resulting panto-compound 
(e.g., pantothenate) can subsequently be converted to a pantothenate salt (e.g., calcium 
10 pantothenate) as described herein. 

Preferably, a desired compound of the present invention is "extracted", 
"isolated" or "purified" such that the resulting preparation is substantially free of other 
components (e.g., free of media components and/or fermentation byproducts). The 
language "substantially free of other components" includes preparations of desired 
1 5 compound in which the compound is separated {e.g. , purified or partially purified) from 
media components or fermentation byproducts of the culture from which it is produced. 
In one embodiment, the preparation has greater than about 80%<by dry weight) of the 
desired compound (e.g. , less than about 20% of other media-components or fermentation 
byproducts), more preferably greater than about 90% of the desired^compound (e.g., less 
20 than about 1 0% of other media components or fermentation byproducts), still more 

preferably greater than about 95% of the desired compound (e.g., less than about 5% of 
other media components or fermentation byproducts), and most preferably greater than 
about 98-99% desired compound (e.g., less than about 1-2% other media components or 
fermentation byproducts). When the desired compound is a panto-compound that has 
25 been derivatized to a salt (e.g. a pantothenate salt or pantoate salt), the panto^compound 
is preferably further free {e.g. , substantially free) of chemical contaminants associated 
with the formation of the salt. When the desired compound is a panto-compound that 
has been derivatized to an alcohol, the panto-compound is preferably further free (e.g., 
substantially free) of chemical contaminants associated with the formation of the 
30 alcohol. 

In an alternative embodiment, the desired panto-compound is not purified from 
the microorganism, for example, when the microorganism is biologically non-hazardous 
(e.g., safe). For example, the entire culture (or culture supernatant) can be used as a 
source of product (e.g. , crude product). In one embodiment, the culture <or culture 
35 supernatant) supernatant is used without modification. In another embodiment, the 
culture (or culture supernatant) is concentrated. In yet another embodiment, the culture 
(or culture supernatant) is<iried or lyophilized. 
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II. Panto-Compound Production Methodologies Featuring Ketopantoate 
Reductase-Overexpressing Microorganisms 

One aspect of the invention features methods of producing a panto-compounds 
that involve culturing a ketopantoate reductase-overexpressing (KPAR-O) 
5 microorganism under conditions such that the panto-compound is produced The term ' 
"ketopantoate reductase-overexpressing (KPAR-O) microorganism" includes a 
microorganism which has been manipulated such that ketopantoate reductase is 
overexpressed (e.g., a B. subtilis ketopantoate reductase protein having the amino acid 
sequence of SEQ ID NO:30) and/or has been transformed with a vector comprising a 

1 0 panEl nucleic acid sequence {e.g. , a B. subtilis panEl nucleic acid sequence as set forth 
in SEQ ID NO:29). In one embodiment, the panto-compound is pantothenate. In 
another embodiment, the panto-compound is pantoate. In another embodiment, the 
ketopantoate reductase is bacterial-derived. In another embodiemnt, the ketopantoate 
reductase is derived from Bacillus {e.g., is derived from Baqllus subtilis). In yet 

15 another embodiment, the KPAR-O microorganism is Gram positive. In yet another 
embodiment, the KPAR-O microorganism is a microorganism belonging to a genus 
selected from the group consisting of Bacillus, Cornyebacterium, Lactobacillus, 
Lactococci and Streptomyces. In a preferred embodiemnt, the KPAR-O microorganism * 
is of the genus Bacillus. In a more preferred embodiment, the KPAR-O microorganism ' 

20 is selected from the group consisting of Bacillus licheniformis, Bacillus 

amyloliquefaciens, Bacillus halodurans, Bacillus subtilis and Bacillus pumilus. In a, 
particularly preferred embodiemnt, the KPAR-O microorganism is Bacillus subtilis. 

In still other embodiments, the KPAR-O microorganism further overexpresses at 
least one pantothenate biosynthetic enzyme in addition to ketopantoate reductase. In an 

25 exemplary embodiment, the KPAR-O microorganism further overexpresses at least one 
of ketopantoate hydroxymethyltransferase, pantothenate synthetase and aspartate-a- 
decarboxylase. Also featured are methods of producing panto-compounds, for example, 
methods that involve culturing a KPAR-O microorganism, which further include the 
step of recovering the panto-compound. 

30 

III Methods of Producing Panto-Compounds Independent of Precursor Feed 
Requirements 

Depending on the biosynthetic enzyme or combination of biosynthetic enzymes 
manipulated, it may be desirable or necessary to provide {e.g., feed) microorganisms of 
35 the present invention at least one pantothenate biosynthetic precursor such that 

pantothenate or other desired panto-compounds are produced. The term "pantothenate 
biosynthetic precursor" or "precursor" includes an agent or compound which, when 



WO 01/021772 



PCT/US00/25993 



-26- 

provided to, brought into contact with, or included in the culture medium of a 
microorganism, serves to enhance or increase pantothenate biosynthesis. In one 
embodiment, the pantothenate biosynthetic precursor or precursor is aspartate. In 
another embodiment, the pantothenate biosynthetic precursor or precursor is ^-alanine. 
5 The amount of aspartate or p-alanine added is preferably an amount that results in a 
concentration in the culture medium sufficient to enhance productivity of the 
microorganism (e.g., a concentration sufficient to enhance production of a panto- 
compound, for example, P-alanine, ketopantoate, pantoate or pantothenate). 
Pantothenate biosynthetic precursors of the present invention can be added in the form 
10 of a concentrated solution or suspension (e.g. , in asui table solvent such as water or 
buffer) or in the form of a solid (e.g., in the form of a powder). Moreover, pantothenate 
biosynthetic precursors of the present invention can be added afc a single aliquot, . 
continuously or intermittently over a given period of time. 

In yet another embodiment, the pantothenate biosynthetic precursor is valine, see 
1 5 e.g. , Example III. In yet another embodiment, tide pantothenate biosynthetic precursor is 
a-ketoisovalerate. Preferably, valine or a-ketoisovalerate is added in an amount that 
results in a concentration in the medium sufficient for production of the desired product . 
(e.g., panto-compound) to occur. Pantothenate biosynthetic precursors are also referred 
to herein as "supplemental pantothenate biosynthetic substrates". 
20 Providing pantothenate biosynthetic precursors in the pantothenate biosynthetic 

methodologies of the present invention, can be associated with high costs, for example, 
when the methodologies are used to produce high yields of panto-compounds. 
Accordingly, preferred methodologies of the present invention feature microorganisms 
having at least one biosynthetic enzyme or combination of biosynthetic enzymes-(e.g., at 
25 least one pantothenate biosynthetic enzyme and/or valine-isoieucine biosynthetic 
enzyme) manipulated such that pantothenate or other desired panto-compounds are 
produced in a manner independent of precursor feed. The phrase "a manner 
independent of precursor feed", for example, when referring to a method for producing a 
desired compound (e.g., a panto-compound), includes an approach to or a mode of 
30 producing the desired compound that does not depend or rely on precursors being 
provided (e.g., fed) to the microorganism being utilized to produce the desired 
compound. For example, microorganisms featured in the methodologies of the present 
invention can be used to produce panto-compounds in a manner requiring no feeding of 
the precursors aspartate, P-alanine, valine and/or a-KIV. 
35 Alternative preferred methodologies of the present invention feature 

microorganisms having at least one biosynthetic enzyme or -combination of biosynthetic 
enzymes manipulated such that pantothenate or other desired panto-compounds are 
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produced in a manner substantially independent of precursor feed. The phrase "a 
manner substantially independent of precursor feed" includes an approach to or a 
method of producing the desired compound that depends or relies to a lesser extent on 
( precursors being provided (e.g., fed) to the microorganism being utilized. For example, 
5 microorganisms featured in the methodologies of the present invention can be used to 
produce panto-compounds in a manner requiring feeding of substantially reduced 
amounts of the precursors aspartate, p-alanine, valine and/or a-KIV. In one 
embodiment, the invention features methods of producing panto-compounds (e.g., 
pantothenate) in a manner that requires feeding of less than 5%-10% of the amount of 

10 precursor required by a control microorganism (e.g. , a microorganism that is dependent, 
for example is wholly dependent, on precursor feed to efficiently produce the desired 
compound). In another embodiment, the invention features methods of producing 
panto-compounds in a manner that requires feeding of less than 15-20% of the amount 
of precursor required by a control microorganism. In another embodiment, the 

1 5 invention features methods of producing panto-compounds in a manner that requires 
feeding of less than 25-30%, 35-40%, 45-50% or 55-60% of the amount of precursor 
required by a control microorganism. As described in Examples I-HI herein, particular 
microorganisms featured in the methodologies of the present invention require, for 
example, 5 g/L of aspartate, 0-alanine, valine or a-KIV (e.g., in test tube or in shake 

20 flask cultures). Accordingly, in a preferred embodiment, the present invention features 
methods of producing panto-compounds (e.g. , pantothenate) in a manner requiring 
feeding of less than 0.25 g/L, 0.5 g/L, 0.75 g/L, 1 g/L, 1.25 g/L, 1.5 g/L, 1.75 g/L, 2 g/L, 
2.25 g/L, 2.5 g/L, 2.75 g/L or 3 g/L. 

Preferred methods of producing desired compounds (e.g., panto-compounds) in a 

25 manner independent of precursor feed or alternatively, in a manner substantially 
independent of precursor feed, involve culturing microorganisms which have been 
manipulated (e.g., designed or engineered, for example, genetically engineered) such 
that expression of at least one pantothenate biosynthetic enzyme, and/or at least one 
isoleucine-valine biosynthetic enzyme, is modified. For example, in one embodiment, a 

30 microorganism is manipulated (e.g. , designed or engineered) such that the production of 
at least one pantothenate biosynthetic enzyme, and/or at least one isoleucine/valine 
biosynthetic enzyme is deregulated. In a preferred embodiment, a microorganism is 
manipulated (e.g., designed or engineered) such that it has a deregulated biosynthetic 
pathway, for example, a deregulated pantothenate biosynthesis pathway and/or a 

35 deregulated isoleucine-valine biosynthetic pathway, as defined herein. In another 

preferred embodiment, a microorganism is manipulated (e.g., designed or engineered) 
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such that at least one pantothenate biosynthetic enzyme, ajid/or at least one isoteueine- 
valine biosynthetic enzyme is overexpressed. 

Preferred methods of producing desired compounds {e.g. , panto-compounds) in a 
' manner independent of precursor feed or alternatively, in a manner substantially 
5 independent of precursor feed, are as follows. In one embodiment, the invention 

features a method of producing pantothenate in a manner independent of precursor feed 
comprising culturing an aspartate-a-decarboxylase-overexpressing (AaD-O) 
microorganism having a deregulated isoleucine-valine (Hv) pathway under ^conditions 
such that pantothenate is produced. In another embodiment, the invention features a 
10 method of producing pantothenate in a manner independent of precursor feed 
comprising culturing an aspaitate-a-decarboxy lase-overexpressing < AaD-O) 
microorganism having a deregulated pantothenate (pan) pathway and a deregulated 
isoleucine-valine (i/v) pathway, under conditions such that pantothenate is produced. In 
, another embodiment, the invention features a method of producing pantothenate in a 
1 5 manner independent of aspartate or 0-alanine feed comprising culturing an aspartate-a- 
decarboxy lase-overexpressing t AaD-O) microorganism under conditions such that 
pantothenate is produced. In yet another embodiment, the invention features a method . 
of producing pantothenate in a manner independent of valine or a-ketoiso valerate feed 
comprising culturing a microorganism having a deregulated isoleucine-valine (ilv) 
20 biosynthetic pathway under conditions such that pantothenate is produced. 

The term "aspartate-a-decarboxylase-overexpressing {AaD-O) microorganism" 
includes a microorganism which has been manipulated such that aspartate-a- 
decarboxylase is overexpressed. A preferred "aspartate-a-decarboxy lase-overexpressing 
(AaD-O) microorganism" has been transformed with a vector comprising a 5. subtilis 
25 panD nucleic acid sequence (e.g., a panD nucleic acid sequence that encodes an 

aspartate-a-decarboxylase protein having the amino acid sequence of SEQ ID NO:28, 
for example, a panD nucleic acid sequence as set forth in SEQ ID NO:27). 

The phrase "microorganism having a deregulated isoleucine-valine</7v) 
pathway" includes a microorganism having an alteration or modification in at least one 
30 gene encoding an enzyme of the isoleucine-valine (/7v) pathway or having an alteration 
or modification in an operon including more than one gene encoding an enzyme of the 
isoleucine-valine (ilv) pathway. A preferred "microorganism having a deregulated 
isoleucine-valine (ilv) pathway" overexpresses acetohydroxyacid synthetase {e.g., 
acetohydroxyacid synthetase having subunits having the amino acid sequences of SEQ 
35 ID NO:32 and SEQ ID NO:34 or acetohydroxyacid synthetase having the amino acid 
sequence of SEQ ID NO: 87), acetohydroxyacid isomeroreductase {having the amino 
acid sequence of SEQ ID NO:36), or dihydroxyacid dehydratase {having the amino acid 
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sequence of SEQ ID NO:38) and/or has been transformed with a vector comprising ilvB, 
ilvN, i/vC, ilvBN, ilvBNC or alsS nucleic acid sequences (SEQ ID NO:31, SEQ ID 
NO:33, SEQ ID NO:35, nucleotides 1-2246 of SEQ ID NO:58, SEQ ID NO:58 having 
coding regions from nucleotides 1-1725, 1722-2246 and 2263-3291, or SEQ ID NO:86, 
5 respectively) and/or an ilvD nucleic acid sequence (SEQ ID NO:37). 

TV. High Yield Production Methodologies 

A particularly preferred embodiment of the present invention is a high yield 
production method for producing pantothenate comprising culturing a manipulated 

10 microorganism under conditions such that pantothenate is produced at a significantly 
high yield. The phrase "high yield production method", for example, a high yield 
production method for producing a desired compound (e.g., for producing a panto- 
compound) includes a method that results in production of the (iesired compound at a 
level which is elevated or above what is usual for comparable production methods. 

1 5 Preferably, a high yield production method results in production of the desired 

compound at a significantly high yield. The phrase "significantly high yield" includes a 
level of production or yield which is sufficiently elevated or above what is usual for 
comparable production methods, for example, which is elevated to a level sufficient for 
commercial production of the desired product (e.g. , production of the product at a 

20 commercially feasible cost). In one embodiment, the invention features a high yield 
production method of producing pantothenate that includes culturing a manipulated 
microorganism under conditions such that pantothenate is produced at a level greater 
than 2 g/L. In another embodiment, the invention features a high yield production 
method of producing pantothenate that includes culturing a manipulated microorganism 

25 under conditions such that pantothenate is produced at a level greater than 1 0 g/L. In 
another embodiment, the invention features a high yield production method of 
producing pantothenate that includes culturing a manipulated microorganism under 
conditions such that pantothenate is produced at a level greater than 20 g/L. In yet 
another embodiment, the invention features a high yield production method of 

30 producing pantothenate that includes culturing a manipulated microorganism under 
conditions such that pantothenate is produced at a level greater than 30 g/L. In yet 
another embodiment, the invention features a high yield production method of 
producing pantothenate that includes culturing a manipulated microorganism under 
conditions such that pantothenate is produced at a level greater than 40 g/L. 

35 The invention further features a high yield production method for producing a 

desired compound (e.g., for producing a panto-compound) that involves culturing a 
manipulated microorganism under conditions such that a sufficiently elevated level of 
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compound is produced within a commercially desireable period of time. In an 
exemplary embodiment, die invention features a high yield production method of 
producing pantothenate that includes culturing a manipulated microorganism under 
'conditions such that pantothenate is produced at a level greater than 15-20 g/L in 36 . 

5 hours. In another embodiment, the invention features a high yield production method-of 
producing pantothenate that includes culturing a manipulated microorganism under 
conditions such that pantothenate is produced at a level greater than 25-30 g/L in 48 
hours. In another embodiment, the invention features a high yield production method of 
producing pantothenate that includes culturing a manipulated microorganism under , 

10 conditions such that pantothenate is produced at aieveLgreater than 35-40-g/L in 72 
, hours, for example, greater that 37 g/L in 72 hours. In another embodiment, the 
invention features a high yield production method of producing pantothenate that 
includes culturing a manipulated microorganism under conditions such that pantothenate 
is produced at a level greater than 30-40 g/L in 60 hours, for example, greater that 30, 35 

15 or 40 g/L in 60 hours. Values and ranges included and/or intermediate within the ranges * 
set forth herein are also intended to be within the scope of the present invention. For 
' example, pantothenate production at levels of at least 3 1 , 32, 33, 34, 35, 36, 37, 38 and . 
39 g/L in 60 hours are intended to be included within the range of 30-40 g/L in 60 horns. 
In another example, ranges of 30-35 g/L or 35-40 g/L are intended to be included within 

20 the range of 30-40 g/L in 60 hours. Moreover, the skilled artisan will appreciate that 
culturing a manipulated microorganism to achieve a production level of, for example, 
"30-40 g/L in 60 hours" includes culturing the microorganism for additional time 
periods (e.g., time periods longer than 60 hours), optionally resulting in even higher 
yields of pantothenate being produced. 

25 

V. Panto-Compound Production Methodologies Featuring Pantothenate 
Kinase Mutant Microorganisms 

The present invention relates to methods of producing pantothenate using 
microorganisms engineered to produce high yields of pantothenate as well as otter 
30 panto-compounds. Cells overproducing pantothenate result in high intracellular 

pantothenate levels that could overcome the feedback inhibition of pantothenate kinase 
by Co A, leading to overproduction of Co A. Besides -consuming pantothenate, increased 
synthesis of Co A may cause increased feedback inhibition of the PanB, PanD, PanE or 
PanC reaction, thereby limiting pantothenate production. Accordingly, a reduction in 
35 pantothenate kinase activity may lead to a decrease in Co A levels with resulting 
increases in PanB, PanD, PanE or PanC activity and pantothenate production. 
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Thus, certain methodologies of the present invention are based, at least in part, 
on the identification and characterization of the B. subtilis coaA gene and the 
demonstration that the gene is neither essential for B. subtilis growth (z.e., deletion of the 
coaA gene from the chromosome of B. subtilis is not a lethal event) nor for pantothenate 
5 kinase activity in B. subtilis. A second pantothenate kinase-encoding gene has been 
identified and characterized in B. subtilis, and is termed "cocOC\ This gene 
complements an E. coli mutant that contains a temperature sensitive pantothenate kinase 
and is not related by homology to any previously known pantothenate kinase gene. 
In one aspect, the methodologies of the invention feature recombinant 
1 0 microorganisms that include the coaX gene or that include a mutant coaX gene, having 
reduced pantothenate kinase activity. In one embodiment, the methodologies feature 
such recombinant microorganisms further having a deregulated pantothenate 
biosynthetic pathway. In another embodiment, the methodologies feature such 
recombinant microorganisms further having a deregulated isoleucine-valine (Jlv) 
15 pathway. In a preferred embodiment, the microorganisms belong to the genus Bacillus ' 
(e.g,B. subtilis). 

The methodologies of the invention also feature recombinant microorganisms 
(e.g., microorganisms belong to the genus Bacillus, for example, B. subtilis) that 
include the coaA gene or that include a mutant coaA gene, optionally including a coaX 
20 gene or mutant thereof, having reduced pantothenate kinase activity. In one 

embodiment, the methodologies feature such recombinant microorganisms further 
having a deregulated pantothenate biosynthetic pathway or having a deregulated 
isoleucine-valine {ilv) pathway. Also featured are vectors that include isolated coaX or 
coaA genes as well as mutant coaX and/or coaA genes. Isolated nucleic acid molecules 
25 that include isolated coaX genes or mutant coaX genes are features in addition to 
isolated CoaX proteins and mutant CoaX proteins. 

The above-described nucleic acid molecules (e.g., genes), proteins, vectors, and 
recombinant microorganisms (e.g., mutant microorganisms), are particularly suited for 
use in methods of producing panto-compounds and/or methods of enhancing panto- 
30 compound production. In one embodiment, the invention features a method for 

producing a panto-compound (e.g., pantothenate) that includes culturing a pantothenate 
kinase mutant (e.g., a recombinant microorganism that misexpresses, e.g., is mutated 
for, pantothenate kinase, as defined herein) under conditions such that panto-compound 
is produced. In another embodiment, the invention features a method for enhancing 
35 production of a panto-compound (e.g. , pantothenate) that includes culturing a 

pantothenate kinase mutant (e.g., a recombinant microorganism that misexpresses, e.g., 
is mutated for, pantothenate kinase, as defined herein) under conditions such that 
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production of the panto-compound is produced. As used herein, the term "enhancing" 
(for example, in the context of the phrase "enhancing production") includes increasing 
the level or rate of production of panto-compound (e.g., pantothenate) as compared to 
'the level or rate of production in a non-mutant microorganism (e. g. , a microorganism 
5 having a normal pantothenate kinase gene(s) and/or having normal pantothenate 
production rates and/or levels. 

Preferably, the level of panto-compound produced in methodologies featuring 
the pantothenate kinase mutants of the present invention is increased by at least 5% as 
compared to the level produced by a non-mutant (e.g., a recombinant microorganism. 
1 0 expressing non-mutated pantothenate kinase). Even more preferably , die level of panto- 
, compound is increased 1 0% as compared to methodologies featuring non-mutants. 
Even more preferably, panto-compound levels (e.g., pantothenate levels) are increased 
20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, are increased 2-fold, 5-foid, 10-fold, 50- 
fold, 100-fold or more as compared to methodologies featuring non-mutants. 

15 

VI. Additional Mutations Resulting in Enhanced Panto-Compound 
Production . . . , 

The methodologies of the present invention further can include, for example in 
addition to overexpressing or deregulating a pantothenate biosynthetic enzyme and/or an 
20 isoleucine-valine biosynthetic enzyme, or in addition to mutating a pantothenate-kinase 
encoding gene, deleting or mutating an enzyme that catalyzes the-conversion of key 
pantothenate biosynthesis substrates or precursors to unwanted or undesirable products. 
For example, mutating the ilvE gene (Kuramitsu et al (1985) 1 Biochem. 97:993-999) 
or a homologue thereof (SEQ ID N0^2 or SEQ ID NO:64), thereby limiting the 
25 conversion of a-ketoisovalerate to valine, in addition to mutating a pantothenate kinase 
encoding enzyme, is predicted to result in even further enhanced or increased production 
of panto-compound. Alternatively, mutating the ansB gene (Sun andSeflow^l 99 1 > J. 
Bacteriol 173:3831-3845) or a homologue thereof (SEQ ID NO:66), thereby limiting 
the degradation of aspartate, in addition to mutating a pantothenate kinase encoding 
30 enzyme, is predicted to result in even further enhanced or increased production of panto- 
compound. Alternatively, mutating the afoZ?:gene{R«enna et al {1993) J. Bacteriol. 
175:3863-3875) or a homologue thereof {SEQ ID NO:68), thereby limiting the 
conversion of acetolactate to acetoin, in addition to mutating a pantothenate kinase 
encoding enzyme, is predicted to result in even further enhanced or increased production 
35 of panto-compound. Alternatively, mutating the avtA gene ^encoding alanine-valine 
transaminase or a homologue thereof, thereby limiting the conversion of a- 
ketoisovalerate to valine, in addition to mutating a pantothenate kinase encoding 
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enzyme, is predicted to result in even further enhanced or increased production of panto- 
compound. Mutating the avtA gene can include mutating, for example, an avtA gene 
having the nucleotide sequence of SEQ ID NO:70 (e.g., the E. coli avtA gene), or a 
structural homolog thereof (e.g., a homologue encoding a protein having 30-40%, 40- 
5 50%, 50-60%, 60-70%, 70-80%, 80-90%, 90-95% or more identity with the amino acid 
sequence of SEQ ID NO:71) or a functional homologue (e.g., a gene encoding a 
structurally unrelated protein having alanine-valine transmainase activity. Such 
mutations can be accomplished using the methodologies as exemplified in the Examples 
(e.g., Examples XIII, XV, XVI and XVII). 

1 0 Accordingly, in one embodiment, the invention features a method of producing a 

panto-compound which includes culturing a microorganism having a mutant 
pantothenate kinase-encoding gene and which further has a deletion or mutation in an 
avtA, ilvE, ansB, and/or alsD gene, or homologue thereof. In another embodiment, the 
invention features a method of producing a panto-compound which includes culturing a 

1 5 microorganism having a mutant pantothenate-kinase encoding gene and a deregulated 
pantothenate biosynthetic pathway enzyme and which further has a deletion or mutation 
in an avtA, ilvE, ansB, and/or alsD gene, or homologue thereof. In another embodiment, 
the invention features a method of producing a panto-compound which includes 
culturing a microorganism having a mutant pantothenate-kinase encoding gene and a 

20 deregulated isoleucine-valine biosynthetic pathway enzyme and which further has a 
deletion or mutation in an avtA, ilvE, ansB, and/or alsD gene, or homologue thereof. 

Mutating the alsD gene can be particularly useful when accomplished in 
conjunction with overexpression or deregulation of the alsS gene, for example, to 
prevent carbon (e.g. , acetolactate) from being drawn away from the precursor pool 

25 utilized for ct-KIV production. Accordingly, to maximize the contribution of the als 
locus to panto-compound production, it is desirable to disrupt the alsD gene in addition 
to overexpressing the alsS gene. To disrupt the alsD gene, appropriate fragments of the 
als operon, flanking the alsD gene, are amplified by PCR and cloned to provide 
homology for creating the disruptions. A drug resistance gene, such as the cat gene, is 

30 cloned between the flanking DNA fragments in place of the alsD gene, and the 

linearized DNA is transformed into a pantothenate production strain such as PA824, 
selecting for drug-resistance. To overexpress alsS, the alsS coding sequence (e.g., an 
alsS coding sequence that has been engineered by PCR for expression) is cloned into an 
expression vector. Vectors which express alsS (or alternatively, vectors which express 

35 alsS plus UvQ are the introduced into panto-compound production strains {e.g. , the 
pantothenate producing strain PA824). 
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The methodologies of the present invention further can include, for example in 
addition to overexpressing or deregulating a pantothenate biosynthetic enzyme and/or an 
isoleucine-valine biosynthetic enzyme, or in addition to mutating a pantothenate-kinase 
'encoding gene, deleting or mutating an enzyme that catalyzes the conversion of desired 
5 panto-compounds to unwanted or undesireable downstream products. 

VII Isolated Nucleic Acid Molecules and Genes 

Another aspect of the present invention features isolated nucleic acid molecules 
that encode Bacillus proteins (e.g. , B. subtilis proteins), for example, Bacillus 
1 0 pantothenate biosynthetic enzymes {e.g. , B. subtilis pantothenate biosyntibeticenzymes) 
. or Bacillus valine-isoleucine biosynthetic enzymes {e.g. , B. subtilis valine-isoleucine 
biosynthetic enzymes). Also featured are isolated coaX and/or coaA nucleic acid 
molecules (e.g., isolated coaX mdJoxcoaA genes) as well as isolated nucleic acid 
molecules that include such coaX and/or coaA nucleic acid molecules or genes. 
1 5 The term "nucleic acid molecule" includes DNA molecules {e.g. , iinear,-circular, f 

cDNA or chromosomal DNA) and RNA molecules. {e.g. , tiRNA, rRNA, mRNA) and 
analogs of the DNA or RNA generated using nucleotide analogs. The nucleic acid 
molecule can be single-stranded or double-stranded, but preferably is double--stranded 
DNA. The term "isolated" nucleic acid molecule includes a nucleic acid molecule 
20 which is free of sequences which naturally flank the nucleic acid molecule {L e., 

sequences located at the 5 1 and 3' ends of the nucleic acid molecule) in the chromosomal 
DNA of the organism from which the nucleic acid is derived. In various embodiments, 
an isolated nucleic acid molecule can contain less than about 10 kb, 5 kb, 4kb, 3kb, 2kb, 
1 kb, 0.5 kb, 0.1 kb, 50 bp, 25 bp or 10 bp of nucleotide sequences which naturally flank 
25 the nucleic acid molecule in chromosomal DNA of the microorganism from which the 
nucleic acid molecule is derived. Moreover, an "isolated" nucleic acid molecule, such 
as a cDNA molecule, can be substantially free -of other cellular materials when produced 
by recombinant techniques, or substantially free of chemical precursors or other 
chemicals when chemically synthesized. 
30 The term "gene", as used herein, includes a nucleic acid molecule {e.g. , a DNA 

molecule or segment thereof), forexample, a protein or RNA-encoding nucleic acid 
molecule, that in an organism, is separated from another gene or other genes, by 
intergenic DNA (i.e., intervening or spacer DNA which naturally flanks the gene and/or 
separates genes in the chromosomal DNA of the organism). A gene may direct 
35 synthesis of an enzyme or other protein molecule {e.g. , may comprise coding sequences, 
for example, a contiguous open reading frame (ORF) which encodes a protein) or may 
itself be functional in the organism. A gene in an organism, may be clustered in an 
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operon, as defined herein, said operon being separated from other genes and/or operons 
by the intergenic DNA. Individual genes contained within an operon may overlap 
without intergenic DNA between said individual genes. An "isolated gene", as used 
herein, includes a geke which is essentially free of sequences which naturally flank the 

5 gene in the chromosomal DNA of the organism from which the gene is derived (Le. 9 is 
free of adjacent coding sequences which encode a second or distinct protein or RNA 
molecule, adjacent structural sequences or the like) and optionally includes 5 f and 3 1 
regulatory sequences, for example promoter sequences and/or terminator sequences. In 
one embodiment, an isolated gene includes predominantly coding sequences for a 

10 protein (e.g., sequences which encode Bacillus proteins). In another embodiment, an 
isolated gene includes coding sequences for a protein (eg., for a Bacillus protein) and 
adjacent 5' and/or 3' regulatory sequences from the chromosomal DNA of the organism 
from which the gene is derived (e.g., adjacent 5' and/or 3' Bacillus regulatory 
sequences). Preferably, an isolated gene contains less than about 10 kb, 5 kb, 2 kb, 1 kb, 

15 0.5 kb, 0.2 kb, 0.1 kb, 50 bp, 25 bp or 10 bp of nucleotide sequences which naturally 
flank the gene in the chromosomal DNA of the organism from which the gene is 
derived. 

In one aspect, the present invention features isolated panB nucleic acid 
sequences or genes, isolated panC nucleic acid sequences or genes, isolated panD 

20 nucleic acid sequences or genes, isolated panE nucleic acid sequences or genes, isolated 
HvB, ilvN, ilvBN nucleic acid sequences or genes, isolated alsS nucleic acid sequences or 
genes, isolated ilvC nucleic acid sequences or genes and/or isolated ilvD nucleic acid 
sequences or genes. 

In a preferred embodiment, the nucleic acid or gene is derived from Bacillus 

25 (e.g., is fiac/7/us-derived). The term "derived from Bacillus" or "5acz7/t«-derived" 

includes a nucleic acid or gene which is naturally found in microorganisms of the genus 
Bacillus. Preferably, the nucleic acid or gene is derived from a microorganism selected 
from the group consisting of Bacillus subtilis, Bacillus lentimorbus, Bacillus lentus, 
Bacillus firmus, Bacillus pantothenticus, Bacillus amyloliquefaciens, Bacillus cereus, 

30 Bacillus circulates, Bacillus coagulans, Bacillus licheniformis, Bacillus megateriwn, 
Bacillus pumilus, Bacillus thuringiensis, and other Group 1 Bacillus species, for 
example, as characterized by 16S rRNA type (Priest, supra). In another preferred 
embodiment, the nucleic acid or gene is derived from Bacillus brevis or Bacillus 
stearothermophilus. In another preferred embodiment, the nucleic acid molecules 

35 and/or genes of the present invention are derived from a microorganism selected from 
the group consisting of Bacillus licheniformis, Bacillus amyloliquefaciens, Bacillus 
halodurans, Bacillus subtilis, and Bacillus pumilus. In a particularly preferred 
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embodiment, the nucleic acid or gene is derived fiom Bacillus subtilis {e.g. , is Bacillus 
sw&rt/is-derived). The term "derived from Bacillus subtilis 9 ' or ^Bacillus subtilis- 
derived" includes a nucleic acid or -gene which is naturally found in Bacillus subtilis. In 
1 yet another preferred embodiment, the nucleic acid or gene is a Bacillus gene 

5 homologue {e.g. , is derived from a species distinct fiom Bacillus but having significant 
homology to a Bacillus gene of die present invention, for example, a Bacillus pan gene 
or Bacillus ilv gene). 

Included within the scope of the present invention are bacterial-derived nucleic 
acid molecules or genes and/or ItoriZ/i/s-derived nucleic acid molecules or genes {e.g. , 

10 B. subtilis-tenved nucleic acid molecules or genes), for example, the genes identified by 
the present inventors, for example, Bacillus or B. subtilis -coaJfgenes, -coaA genes, pan 
genes and/or ilv genes. Further included within the scope of the present invention are 
bacterial-derived nucleic acid molecules or genes and/or Bacillus-derived nucleic acid 
molecules or genes {e.g., B. subtilis-derived nucleic acid molecules or genes) (e.g. 9 B. 

1 5 subtilis nucleic acid molecules or genes) which differ from naturally-occurring bacterial 
and/or Bacillus nucleic acid molecules or genes {e.g., B. subtilis nucleic acid molecules 
or genes), for example, nucleic acid molecules or genes which have nucleic acids that . 
are substituted, inserted or deleted, but which encode proteins substantially similar to the 
naturally-occurring gene products of the present invention. In one -embodiment, an 

20 isolated nucleic acid molecule comprises at least one of the nucleotide .sequences set , 
forth as SEQ ID NO:23, SEQ ID NO:25, SEQ ID NO:27,SEQ IDNO:29, SEQ ID NO 
31, SEQ ID NO:33, SEQ ID NO:86, SEQ ID NO:35 or SEQ ID NO:37. In another 
preferred embodiment, an isolated nucleic acid molecule comprises at least two, three or 
four of the nucleotide sequences set forth as SEQ ID NO:23, SEQ ID NO:25,SEQ ID 

25 NO:27, SEQ ID NO:29, SEQ ID NO 3 1 , SEQ ID NO:33, SEQ ID NO:88, SEQ ID 
NO:35 or SEQ ID NO:37. For example, a preferred isolated nucleic acid molecule of 
the present invention can include the nucleotide sequences of SEQ ID NO:23, 1SEQ ID 
NO:25 and SEQ ID NO:27, preferably linked such that the proteins -encoded by the 
nucleotide sequences of SEQ ID NO:23, SEQ ID NO:25 and SEQ ID NO:27 are-each 

30 produced when the isolated nucleic acid molecule is expressed in a microorganism {e. g. , 
SEQ ID NO:59). In another example, a preferred isolated nucleic acid molecule of the 
present invention can include the nucleotide sequences of SEQ ID NO:3 1 andUEQID 
NO:33, preferably linked such that the proteins encoded by the nucleotide sequences of 
SEQ ED NO:31 and SEQ ID NO:33 are each produced when the isolated nucleic acid 

3 5 molecule is expressed in a microorganism {e.g. , nucleotides 1 -2246 of SEQ ID NO :5 8) . 
In another example, a preferred isolated nucleic acid molecule of the present invention 
can include the nucleotide sequence of SEQ ID NO:86. In another example, a preferred 
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isolated nucleic acid molecule of the present invention can include the nucleotide 
sequences of SEQ ID NO:31, SEQ ID NO:33 and SEQ ID NO:35, preferably linked 
such that the proteins encoded by the nucleotide sequences of SEQ ID NO:31, SEQ ID 
NO:33 and SEQ ID NO:35 are each produced when the isolated nucleic acid molecule is 
5 expressed in a microorganism (e.g. , SEQ ID NO:58). 

In another embodiment, an isolated nucleic acid molecule of the present 
invention comprises a nucleotide sequence which is at least about 60-65%, preferably at 
least about 70-75%, more preferable at least about 80-85%, and even more preferably at 
least about 90-95% or more identical to a nucleotide sequence set forth as SEQ ID 
10 NO:23, SEQ ID NO:25, SEQ ID NO:27, SEQ ID NO:29, SEQ ID NO 31 , SEQ ID 

NO:33, SEQ ID NO:88, SEQ ID NO:35 or SEQ ID NO:37. In another embodiment, an 
isolated nucleic acid molecule hybridizes under stringent conditions to a nucleic acid 
molecule having a nucleotide sequence set forth as SEQ ID NO:23, SEQ ID NO:25, 
SEQ ID NO:27, SEQ ID NO:29, SEQ ID NO 31, SEQ ID NO:33, SEQ ID NO:88, SEQ 
1 5 ID NO:35 or SEQ ID NO:37. Such stringent conditions are known to mose skilled in 
the art and can be found in Current Protocols in Molecular Biology, John Wiley & Sons, 
N.Y. (1989), 6.3.1-6.3.6. A preferred, non-limiting example of stringent (e.g. high 
stringency) hybridization conditions are hybridization in 6X sodium chloride/sodium 
citrate (SSC) at about 45°C, followed by one or more washes in 0.2 X SSC, 0.1% SDS 
20 at 50-65°C. Preferably, an isolated nucleic acid molecule of the invention that 
hybridizes under stringent conditions to the sequence of SEQ ID NO:23, SEQ ID 
NO:25, SEQ ID NO:27, SEQ ID NO:29, SEQ ID NO 31, SEQ ID NO:33, SEQ ID 
NO:88, SEQ ID NO:35 or SEQ ID NO:37 corresponds to a naturally-occurring nucleic 
acid molecule. As used herein, a "naturally-occurring" nucleic acid molecule refers to 
25 an RNA or DNA molecule having a nucleotide sequence that occurs in nature. 

A nucleic acid molecule of the present invention (e.g., a nucleic acid molecule 
having the nucleotide sequence of SEQ ID NO:23, SEQ ID NO:25, SEQ ID NO:27, 
SEQ ID NO:29, SEQ ID NO 31, SEQ ED NO:33, SEQ ID NO:88, SEQ ID NO:35 or 
SEQ ID N0.37 can be isolated using standard molecular biology techniques and the 
30 sequence information provided herein. For example, nucleic acid molecules can be 
isolated using standard hybridization and cloning techniques (e.g., as described in 
Sambrook, J., Fritsh, E. F., and Maniatis, T. Molecular Cloning: A Laboratory Manual. 
2nd, ed, Cold Spring Harbor Laboratory, Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, NY, 1989) or can be isolated by the polymerase chain reaction using 
35 synthetic oligonucleotide primers designed based upon the sequence of SEQ ID NO:23, 
SEQ ID NO:25, SEQ ID NO:27, SEQ ED NO:29, SEQ ID NO 3 1, SEQ ED NO:33, SEQ 
ED NO:88, SEQ ED NO:35 or SEQ ED NO:37. A nucleic acid of the invention can be 



PCT7US00/25993 . 1 

i 



WO 01/021772 



PCT/US00/25993 



-38- 

amplified using cDNA, mRNA or alternatively, genomic DNA, as a template and 
appropriate oligonucleotide primers according to standard PCR amplification 
techniques. In another preferred embodiment, an isolated nucleic acid molecule of the 
1 invention comprises a nucleic acid molecule which is a complement of the nucleotide 
5 sequence shown in SEQ ID NO:23, SEQ ID NO:25, SEQ ID NO:27, SEQ ID NO:29, 
SEQ ID NO:33, SEQ ID NO 31, SEQ ID NO:33,SEQ ID NO:88, SEQ ID NO:35. 

Additional panC nucleic acid sequences include those that comprise the 
nucleotide sequence of SEQ ID NO:25, encode a homologue of the polypeptide having 
the amino acid sequence set forth in SEQ ID ND:26<eg., encode a polypeptide having 
1 0 at least 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95% or more identity to the polypeptide 
. having the amino acid sequence as set forth in SEQ ID NO:26 and a substantially 
identical activity as said polypeptide), hybridize under stringent conditions to all or a 
portion of a nucleic acid molecule having the nucleotide sequence of SEQ ID NO:25 or 
to all or a portion of a nucleic acid molecule that encodes a polypeptide having the 
1 5 amino acid sequence of SEQ ID NO:26, or are complementary to SLpanC nucleotide 
sequence as set forth herein. 

Aditional panD nucleic acid sequences include those that comprise the 
nucleotide sequence of SEQ ID NO:27, encode a homologue of the polypeptide having 
the amino acid sequence set forth in SEQ ID NO:28 <e.g., encode a polypeptide having 
20 at least 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95% or more identity to the polypeptide 
having the amino acid sequence as set forth in SEQ ID NO:28 and a substantially 
identical activity as said polypeptide), hybridize under stringent conditions to all or a 
portion of a nucleic acid molecule having the nucleotide sequence of SEQ ID NO:2:7 or 
to all or a portion of a nucleic acid molecule that encodes a polypeptide having the 
25 amino acid sequence of SEQ ID NO:28, or are complementary to a panD nucleotide 
sequence as set forth herein. 

Additional panE nucleic acid sequences include those that comprise the 
nucleotide sequence of SEQ ID NO:29, encode a homologue of the polypeptide having 
the amino acid sequence set forth in SEQ ID NO:30 (e.g., encode a polypeptide having 
30 at least 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95% or more identity to the polypeptide 
having the amino acid sequence as set forth in SEQ ID NO:30 and a substantially 
identical activity as said polypeptide), hybridize under stringent conditions to all or a 
portion of a nucleic acid molecule having the nucleotide sequence of SEQ ID NO:29 or 
to all or a portion of a nucleic acid molecule that encodes a polypeptide having the 
35 amino acid sequence of SEQ ID NO:30, or are complementary to a panE nucleotide 
sequence as set forth herein. 
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Additional ilvB nucleic acid sequences are those that comprise the nucleotide 
sequence of SEQ ID NO:31, encode a homologue of the polypeptide having the amino 
acid sequence set forth in SEQ ED NO:32 {e.g. , encode a polypeptide having at least 
30%, 40%, 50%, 60%, 70%, 80%, 90%, 95% or more identity to the polypeptide having 

5 the amino acid sequence as set forth in SEQ ID NO:32 and a substantially identical 
activity as said polypeptide), hybridize under stringent conditions to all or a portion of a 
nucleic acid molecule having the nucleotide sequence of SEQ ID NO:31 or to all or a 
portion of a nucleic acid molecule that encodes a polypeptide having the amino acid 
sequence of SEQ ID NO:32, or are complementary to an ilvB nucleotide sequence as set 

10 forth herein. 

Additional ilvN nucleic acid sequences are those that comprise the nucleotide 
sequence of SEQ ID NO:33, encode a homologue of the polypeptide having the amino 
acid sequence set forth in SEQ ID NO:34 (e.g., encode a polypeptide having at least 
30%, 40%, 50%, 60%, 70%, 80%, 90%, 95% or more identity to the polypeptide having 
1 5 the amino acid sequence as set forth in SEQ ID NO:34 and a substantially identical 

activity as said polypeptide), hybridize under stringent conditions to all or a portion of a 
nucleic acid molecule having the nucleotide sequence of SEQ ID NO:33 or to all or a 
portion of a nucleic acid molecule that encodes a polypeptide having the amino acid 
sequence of SEQ ID NO:34, or are complementary to an ilvN nucleotide sequence as set 

20 forth herein. 

Additional UvC nucleic acid sequences include those that comprise the 
nucleotide sequence of SEQ ID NO:35, encode a homologue of the polypeptide having 
the amino acid sequence set forth in SEQ ID NO:36 (e.g. , encode a polypeptide having 
at least 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95% or more identity to the polypeptide 

25 having the amino acid sequence as set forth in SEQ ID NO:36 and a substantially 
identical activity as said polypeptide), hybridize under stringent conditions to all or a 
portion of a nucleic acid molecule having the nucleotide sequence of SEQ ID NO:35 or 
to all or a portion of a nucleic acid molecule that encodes a polypeptide having the 
amino acid sequence of SEQ ID NO:36, or are complementary to an ilvC nucleotide 

30 sequence as set forth herein. 

Additional ilvD nucleic acid sequences include those that comprise the 
nucleotide sequence of SEQ ID NO:37, encode a homologue of the polypeptide having 
the amino acid sequence set forth in SEQ ID NO:38 (e.g., encode a polypeptide having 
at least 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95% or more identity to the polypeptide 

35 having the amino acid sequence as set forth in SEQ ID NO:38 and a substantially 
identical activity as said polypeptide), hybridize under stringent conditions to all or a 
portion of a nucleic acid molecule having the nucleotide sequence of SEQ ID NO:37 or 
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to all or a portion of a nucleic acid molecule that encodes a polypeptide having the 
amino acid sequence of SEQ ID NO:38, or are complementary to an ilvD nucleotide 
sequence as set forth herein. 

Additional alsS nucleic acid sequences include those mat comprise the 
5 nucleotide sequence of SEQ ID NO:86, encode a homologue of the polypeptide having 
the amino acid sequence set forth in SEQ ID NO:87<e,g, encode a polypeptide having 
at least 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95% or more identity to the polypeptide 
having the amino acid sequence as set forth in SEQ ID NO:87 and a substantially 
identical activity as said polypeptide), hybridize under stringent conditions to all or a 
1 0 portion of a nucleic acid molecule having the nucleotide sequence of SEQ ID NO:86 or 
to all or a portion of a nucleic acid molecule that encodes a polypeptide having the 
amino acid sequence of SEQ ID NO:87, or are complementary to an alsS nucleotide 
sequence as set forth herein. 

In another embodiment, an isolated nucleic acid molecule is or includes a coaX 
1 5 gene, or portion or fragment thereof. In one embodiment, an isolated coaX nucleic acid 
molecule or gene comprises the nucleotide sequence as set forth in SEQ ID NO:19 (e.g., 
comprises the B. subtilis coaX nucleotide sequence). In another embodiment, an 
isolated coaX nucleic acid molecule or gene comprises a nucleotide sequence that 
encodes the amino acid sequence as set forth in SEQ ID NO:9 (e.g., encodes the B. 
20 subtilis CoaX amino acid sequence). In yet another embodiment, an isolated coaX 
nucleic acid molecule or gene encodes a homologue of the CoaX protein having the 
amino acid sequence of SEQ ID NO:9. As used herein, the term "homologue" includes 
a protein or polypeptide sharing at least about 30-35%, preferably at least about 35-40%, 
more preferably at least about 40-50%, and even more preferably at least about 60%, 
25 70%, 80%, 90% or more identity with the amino acid sequence of a wild-type protein or 
polypeptide described herein and having a substantially equivalent functional or 
biological activity as said wild-type protein or polypeptide. For example, a CoaX 
homologue shares at least about 30-35%, preferably at least about 35-40%, more 
preferably at least about 40-50%, and even more preferably at least about 60%, 70%, 
30 80%, 90% or more identity with the protein having the amino acid sequence set forth as 
SEQ ED NO:9 and has a substantially equivalent functional or biological activity (i.e., is 
a functional equivalent) of the protein having the amino acid sequence set forth as SEQ 
ID NO:9 (e.g. , has a substantially equivalent pantothenate kinase activity). In a 
preferred embodiment, an isolated coaX nucleic acid molecule or gene comprises a 
35 nucleotide sequence that encodes a polypeptide as set forth in any one of SEQ ID NO:7, 
SEQ ID NO:8, SEQ ID NO:10, SEQ ID NO:l 1, SEQ ID NO:12, SEQ ED NO:13,SEQ 
ID NO:14, SEQ ID NO: 15, SEQ DD NO:16, SEQ ID NO:17, SEQ ID NO:18,SEQ ID 
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NO:74 or SEQ ID N0.75. In another embodiment, an isolated coaATnucleic acid 
molecule hybridizes to all or a portion of a nucleic acid molecule having the nucleotide 
sequence set forth in SEQ ID NO: 19 or hybridizes to all or a portion of a nucleic acid 
molecule having a nucleotide sequence that encodes a polypeptide having the amino 
5 ' acid sequence of any of SEQ ID NOs:7-l 8, 74 or 75. Such hybridization conditions are . 
known to those skilled in the art and can be found in Current Protocols in Molecular 
Biology, Ausubel et al. , eds., John Wiley & Sons, Inc. (1 995), sections 2, 4 and 6. 
Additional stringent conditions can be found in Molecular Cloning: A Laboratory 
Manual, Sambrook et al, Cold Spring Harbor Press, Cold Spring Harbor, NY (1989), 
1 0 chapters 7, 9 and 1 1 . A preferred, non-limiting example of stringent hybridization . 
conditions includes hybridization in 4X sodium chloride/sodium citrate (SSC), at about 
65-70°C (or hybridization in 4X SSC plus 50% formamide at about 42-50°C) followed 
by one or more washes in IX SSC, at about 65-70°C. A preferred, non-Umiting 
example of highly stringent hybridization conditions includes hybridization in IX SSC, 
15 at about 65-70°C <or hybridization in IX SSC plus 50% formamide at about 42-50°C) 
followed by one or more washes in 0.3X SSC, at about 65-70°C. A preferred, non- 
limiting example of reduced stringency hybridization conditions includes hybridization 
in 4X SSC, at about 50-60°C (or alternatively hybridization in 6X SSC plus 50% 
formamide at about 40-45°C) followed by one or more washes in 2X SSC, at about 50- 
20 60°C. Ranges intermediate to the above-recited values, e.g. , at 65-70°C or at 42-50°C 
are also intended to be encompassed by the present invention. SSPE (IX SSPE is 0.15 
M NaCl, lOmM NaH 2 P0 4 , and 1.25 mM EDTA pH 7.4) can be substituted for SSC (IX 
SSC is 0.1 5 M NaCl and 15 mM sodium citrate) in the hybridization and wash buffers; 
washes are performed for 1 5 minutes each after hybridization is complete. The 
25 hybridization temperature for hybrids anticipated to be less than 50 base pairs in length 
should be 5-10°C less than the melting temperature (T m ) of the hybrid, where T m is 
determined according to the following equations. For hybrids less than 18 base pairs in 
length, T m (°C) = 2(# of A + T bases) + 4(# of G + C bases). For hybrids between 1 8 and 
49 base pairs in length, T^'C) = 81.5 + ^(logjotNa 4 ]) + 0.41(%G+C) - (600/N), 
30 where N is the number of bases in the hybrid, and [Na*] is the concentration of sodium 
ions in the hybridization buffer ({Na*] for IX SSC = 0.165 M). It will also be 
recognized by the skilled practitioner that additional reagents may be added to 
hybridization and/or wash buffers to decrease non-specific hybridization of nucleic acid 
molecules to membranes, for example, nitrocellulose or nylon membranes, including but 
35 not limited to blocking agents (e.g. , BS A or salmon or herring sperm carrier DN A), 
detergents (e.g., SDS), chelating agents (e.g., EDTA), Ficoll, PVP and the like. When 
using nylon membranes, in particular, an additional preferred, non-limiting example of 
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stringent hybridization conditions is hybridization in 0.25-O.5M NaHfePO^ 7% SDS at 
about 65°C, followed by one or more washes atO.02M NaH 2 P0 4 , 1% SDS at 65°C, see 
e.g., Church and GUbert (1984) Proc. Natl Acad, Sci. USA 81:1991-1995, (or, 
alternatively, 0.2X SSC, 1% SDS). In another preferred embodiment, an isolated 
5 nucleic acid molecule comprises a nucleotide sequence that is complementary to a coaX 
nucleotide sequence as set forth herein (e.g. , is the full complement of the nucleotide 
sequence set forth as SEQ ID NO: 1 9). 

In another preferred embodiment, an isolated nucleic acid molecule is or 
includes a coaA gene, for example, a Bacillus (e.g., B. subtilis) coaA gene, or portion or 
1 0 fragment thereof. Exemplary isolated coaA nucleic acid molecules and/or genes include 
(1) an isolated coaA nucleic acid molecule or gene comprising the nucleotide sequence 
as set forth in any one of SEQ ID NOs:20-22; (2) an isolated codA nucleic acid molecule 
or gene comprising a nucleotide sequence that-encodes the amino acid sequence as set 
forth in SEQ ID NO:3; (3) an isolated coaA nucleic acid molecule or gene comprising a 
1 5 nucleotide sequence which encodes a CoaA homologue {e.g. , a polypeptide having an 
amino acid sequence at least about 30-35%, preferably at least about 35-40%, more 
preferably at least about 40-50P/o, and even more preferably at least about 60%, 70%, . 
80%, 90% or more identical to the amino acid sequence set forth as SEQ ID NO:3 and 
having a substantially equivalent enzymatic activity; (4) an isolated coaA nucleic acid 
20 molecule or gene comprising a nucleotide sequence that encodes a polypeptide as set 
forth in any one of SEQ ID NO:l , SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:5 or SEQ 
ID NO:6; (5) an isolated nucleic acid molecule that hybridizes under stringent 
conditions to all or a portion of a nucleic acid molecule having the nucleotide sequence 
set forth in SEQ ID NO:20, SEQ ID NO:21 or SEQ ID NO:22 or hybridizes to all or a 
25 portion of a nucleic acid molecule having a nucleotide sequence that encodes a 
polypeptide having the amino acid sequence of SEQ ID NO:3; and (6) an isolated 
nucleic acid molecule comprising a nucleotide sequence that is complementary to a 
coaA nucleotide sequence as set forth herein (e.g. , is the full complement of the 
nucleotide sequence set forth in SEQ ID NO:20, SEQ ID NO:21 or SEQ ID NO:22). 
30 A nucleic acid molecule of the present invention (e.g. , a coaX nucleic acid 

molecule or gene or a coaA nucleic acid molecule or gene), can be isolated using 
standard molecular biology techniques and the sequence information provided herein. 
For example, nucleic acid molecules can be isolated using standard hybridization and 
cloning techniques (e.g., as described in Sambrook, J., Fritsh, E. F., and Maniatis, T. 
35 Molecular Cloning: A Laboratory Manual 2nd, ed, Cold Spring Harbor Laboratory, 
Cold Spring Harbor Laboratory Press, Cold Spring Haibor, NY, 1989) or can be isolated 
by the polymerase chain reaction using synthetic oligonucleotide primers designed 
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based upon the cooXot coaA nucleotide sequences set forth herein, or flanking 
sequences thereof. A nucleic acid of the invention (e.g. , a coaX nucleic acid molecule or 
gene or a coaA nucleic acid molecule or gene), can be amplified using cDNA, mRNA or 
alternatively, chromosomal DNA, as a template and appropriate oligonucleotide primers 

5 according to standard PCR amplification techniques. 

Yet another embodiment of the present invention features mutant coaXand coaA 
nucleic acid molecules or genes. The phrase "mutant nucleic acid molecule" or "mutant 
gene" as used herein, includes a nucleic acid molecule or gene having a nucleotide 
sequence which includes at least one alteration (e.g., substitution, insertion, deletion) 

1 0 such that the polypeptide or protein that may be encoded by said mutant exhibits an' 
activity that differs from the polypeptide or protein encoded by the wild-type nucleic 
acid molecule or gene. Preferably, a mutant nucleic acid molecule or mutant gene (e.g. , 
a mutant coaA or coo^fgene) encodes a polypeptide or protein having a reduced activity 
(e.g., having a reduced pantothenate kinase activity) as compared to the polypeptide or 

15 1 protein encoded by the wild-type nucleic acid molecule or gene, for example, when 
assayed under similar conditions (e.g. , assayed in microorganisms cultured at the same 
temperature). A mutant gene also can encode no polypeptide or have a reduced level of 
production of the wild-type polypeptide. 

As used herein, a "reduced activity" or "reduced enzymatic activity" is one that 

20 is at least 5% less than that of the polypeptide or protein encoded by the wild-type 
nucleic acid molecule or gene, preferably at least 5-10% less, more preferably at least 
1 0-25% less and even more preferably at least 25-50%, 50-75% or 75-100% less than 
that of the polypeptide or protein encoded by the wild-type nucleic acid molecule or 
gene. Ranges intermediate to the above-recited values, e.g., 75-85%, 85-90%, 90-95%, 

25 are also intended to be encompassed by the present invention. As used herein, a 
"reduced activity" or "reduced enzymatic activity" also includes an activity that has 
been deleted or "knocked out" (e.g., approximately 100% less activity than that of the 
polypeptide or protein encoded by the wild-type nucleic acid molecule or gene). 
Activity can be determined according to any well accepted assay for measuring activity 

30 of a particular protein of interest. Activity can be measured or assayed directly, for 
example, measuring an activity of a protein isolated or purified from a cell. 
Alternatively, an activity can be measured or assayed within a cell or in an extracellular 
medium. For example, assaying for a mutant coaA gene or a mutant coaX gene {i.e., 
said mutant encoding a reduced pantothenate kinase activity) can be accomplished by 

35 expressing the mutated gene in a microorganism, for example, a mutant microorganism 
which expresses pantothenate kinase in a temperature-sensitive manner, assaying the 
mutant gene for the ability to complement a temperature sensitive (Ts) mutant for 
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pantothenate kinase activity. A coaX mutant gene or coaA mutant gene that -encodes a 
"reduced pantothenate kinase activity" is one that complements the Ts mutant less 
effectively than, for example, a corresponding wild-type coaJf-gene or coaA gene. 

It will be appreciated by the skilled artisan that even a single substitution in a 

5 nucleic acid or gene sequence (e.g. , a base substitution that encodes an amino acid 

change in the corresponding amino acid sequence) can dramatically affect the activity of 
an encoded polypeptide or protein as compared to the corresponding wild-type 
polypeptide or protein. A mutant nucleic acid or mutant -gene (e.g. , encoding a mutant 
polypeptide or protein), as defined herein, is readily distinguishable from a nucleic acid 

10 or gene encoding a protein homologue, as described above, in that a mutant nucleic acid 
or mutant gene encodes a protein or polypeptide having an altered activity, optionally 
observable as a different or distinct phenotype in a microorganism -expressing ^aid 
mutant gene or nucleic acid or producing said mutant protein or polypeptide {i.e. , a 
mutant microorganism) as compared to a corresponding microorganism expressing the 

1 5 wild-type gene or nucleic acid or producing said mutant protein or polypeptide. By ' 
contrast, a protein homologue has an identical or substantially similar activity, 
optionally phenotypically indiscernable when produced in a microorganism, as 
compared to a corresponding microorganism expressing the wild-type -gene or nucleic 
acid. Accordingly it is not, for example, the degree of sequence identity between 

20 nucleic acid molecules, genes, protein or polypeptides that serves to distinguish between 
homologues and mutants, rather it is the activity of the encoded protein or polypeptide 
that distinguishes between homologues and mutants: homologues having, for example, 
low (e.g., 30-50% sequence identity) sequence identity yet having substantially 
equivalent functional activities, and mutants, for example sharing 99% sequence identity 

25 yet having dramatically different or altered functional activities. Exemplary 

homologues are set forth in Figure 20 (i.e., CoaA homologues) and in Figure 23 (i.e., 
CoaX homologues). Exemplary mutants are described in Examples XV and XVIII 
herein. 

30 VIII. Recombinant Nucleic Acid Molecules and Vectors 

The present invention further features recombinant nucleic acid molecules (e.g., 
recombinant DNA molecules) that include nucleic acid molecules and/or genes 
described herein (e.g., isolated nucleic acid molecules and/or genes), preferably Bacillus 
genes, more preferably Bacillus subtilis genes, even more preferably Bacillus subtilis 

3 5 pantothenate kinase genes (e. g. , coaX genes or coaA genes), pantothenate biosynthetic 
genes (e.g. , genes encoding pantothenate biosynthetic enzymes, for example, panB 
genes encoding ketopantoate hydroxymethyltransferase,/7<arn£ genes encoding 
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ketopantoate reductase, panC genes encoding pantothenate synthetase, and/or panD 
genes encoding aspartate-a-decarboxylase) and/or isoleucine-valine (ilv) biosynthetic 
genes {e.g., ilvBNox alsS genes encoding acetohydroxyacid synthetase, ilvC genes 
encoding acetohydroxyacid isomeroreductase and/or ilvD genes encoding dihydroxyacid 
5 dehydratase). 

The present invention further features vectors (e.g. , recombinant vectors) that 
include nucleic acid molecules (e.g. , isolated or recombinant nucleic acid molecules 
and/or genes) described herein. In particular, recombinant vectors are featured that 
include nucleic acid sequences that encode bacterial gene products as described herein, 

1 0 preferably Bacillus gene products, more preferably Bacillus subtilis gene products, even 
more preferably Bacillus subtilis pantothenate biosynthetic gene products (e.g., 
pantothenate biosynthetic enzymes, for example, ketopantoate 
hydroxymethyltransferase, ketopantoate reductase, pantothenate synthetase, and/or 
aspartate-a-decarboxylase) and/or isoleucine-valine biosynthetic gene products (e.g., 

15 , acetohydroxyacid synthetase, acetohydroxyacid isomeroreductase and/or dihydroxyacid 
dehydratase). 

The term "recombinant nucleic acid molecule" includes a nucleic acid molecule 
(e.g. , a DNA molecule) that has been altered, modified or engineered such that it differs 
in nucleotide sequence from the native or natural nucleic acid molecule from which the 
20 recombinant nucleic acid molecule was derived (e.g. , by addition, deletion or 
substitution of one or more nucleotides). Preferably, a recombinant nucleic acid 
molecule (e.g., a recombinant DNA molecule) includes an isolated nucleic acid 
molecule or gene of the present invention (e.g., an isolated coaX, coaA, pan or ilv gene) 
operably linked to regulatory sequences. 

25 The term "recombinant vector" includes a vector (e.g. , plasmid, phage, phasmid, 

virus, cosmid or other purified nucleic acid vector) that has been altered, modified or 
engineered such that it contains greater, fewer or different nucleic acid sequences than 
those included in the native or natural nucleic acid molecule from which the 
recombinant vector was derived. Preferably, the recombinant vector includes a coaX, 

30 coaA, pan or ilv gene or recombinant nucleic acid molecule including such coaX, coaA, 
pan or ilv gene, operably linked to regulatory sequences, for example, promoter 
sequences, terminator sequences and/or artificial ribosome binding sites (RBSs), as 
defined herein. 

The phrase "operably linked to regulatory sequence(s)" means mat the 
35 nucleotide sequence of the nucleic acid molecule or gene of interest is linked to the 
regulatory sequence^) in a manner which allows for expression (e.g., enhanced, 
increased, constitutive, basal, attenuated, decreased or repressed expression) of the 
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nucleotide sequence, preferably expression of a gene product encoded by the nucleotide 
sequence (e.g., when the recombinant nucleic acid molecule is included in a 
recombinant vector, as defined herein, and is introduced into a microorganism). 

The term "regulatory sequence" includes nucleic acid sequences which affect 
5 (e.g., modulate or regulate) expression of other nucleic acid sequences. In one 

embodiment, a regulatory sequence is included in a recombinant nucleic acid molecule 
or recombinant vector in a similar or identical position and/or orientation relative to a 
particular gene of interest as is observed for the regulatory sequence and -gene of interest 
as it appears in nature, e.g., in a native position and/or orientation. For example, a«gene 
10 of interest can be included in a recombinant nucleic acid molecule or recombinant vector 
operably linked to a regulatory sequence which accompanies or is adjacent to the -gene 
of interest in the natural organism (e.g., operably linked to "native" regulatory 
sequences, for example, to the "native" promoter). Alternatively, a^ene of interest can 
be included in a recombinant nucleic acid molecule or recombinant vector operably 
1 5 linked to a regulatory sequence which accompanies or is adjacent to another <Je.g., a 

different) gene in the natural organism. Alternatively, a gene of interest can be included 
in a recombinant nucleic acid molecule or recombinant vector operably linked to a 
regulatory sequence from another organism, ^or -example, regulatory sequences from 
other microbes (e.g., other bacterial regulatory sequences, bacteriophage regulatory 
20 sequences and the like) can be operably linked to a particular^gene of interest. 

In one embodiment, a regulatory sequence is a non-native or non-naturaliy- 
occurring sequence (e.g., a sequence which has been modified, mutated, substituted, 
derivatized, deleted including sequences which are chemically synthesized). Preferred 
regulatory sequences include promoters, enhancers, termination signals, anti-termination 
25 signals and other expression control elements (e.g. , sequences to which repressors or 
inducers bind and/or binding sites for transcriptional and/or translational regulatory 
proteins, for example, in the transcribed mRNA). 'Such regulatory sequences are 
described, for example, in Sambrook, J., Fritsh, E. F., and Maniatis, T. Molecular 
Cloning: A Laboratory Manual 2nd, ed, Cold Spring Harbor Laboratory, Cold Spring 
30 Harbor Laboratory Press, Cold Spring Harbor, NY, 1989. Regulatory sequences include 
those which direct constitutive expression of a nucleotide sequence in a microorganism 
(e.g., constitutive promoters and strong constitutive promoters), those which direct 
inducible expression of a nucleotide sequence in a microorganism (e.g., inducible 
promoters, for example, xylose inducible promoters) and those which attenuate or 
35 repress expression of a nucleotide sequence in a microorganism (e.g. , attenuation signals 
or repressor sequences). It is also within the scope of the present invention to regulate 
expression of a gene of interest by removing or deleting regulatory sequences. For 
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example, sequences involved in the negative regulation of transcription can be removed 
such that expression of a gene of interest is enhanced. 

In one embodiment, a recombinant nucleic acid molecule or recombinant vector 
of the present invention includes a nucleic acid sequence or gene that encodes at least 
5 ' one bacterial gene product (e.g. , a pantothenate biosynthetic enzyme, an isoleucine- 
valine biosynthetic enzyme, or a CoaA biosynthetic enzyme, for example CoaA or 
CoaX) operably linked to a promoter or promoter sequence. Preferred promoters of the 
present invention include Bacillus promoters and/or bacteriophage promoters (e.g. , 
bacteriophage which infect Bacillus). In one embodiment, a promoter is a Bacillus 
10 promoter, preferably a strong Bacillus promoter (e.g. , a promoter associated with a 
biochemical housekeeping gene in Bacillus or a promoter associated with a glycolytic 
pathway gene in Bacillus). In another embodiment, a promoter is a bacteriophage 
promoter. In a preferred embodiment, the promoter is from the bacteriophage SP01. In 
a particularly preferred embodiment, a promoter is selected from the group consisting of 
15 P /5 ,i> 2 ,orP wg ,forexample,mepromoterssetforminSEQIDNO:39,SEQIDNO:40 

or SEQ ID NO:41. Additional preferred promoters include te/(the translational 
elongation factor (TEF) promoter) mApyc (the pyruvate carboxylase (PYC) promoter), 
which promote high level expression in Bacillus (e.g., Bacillus subtilis). Additional ' 
preferred promoters, for example, for use in Gram positive microorganisms include, but 
20 are not limited to, the ^promoter or phage SP02 promoters. Additional preferred 
promoters, for example, for use in Gram negative microorganisms include, but are not 
limited to tac, trp, tet, trp-tet, lpp, lac, Ipp-lac, laclq, 77, T5, T3, gal, trc, ara, SP6, X-P R 
or X-Pl- 

In another embodiment, a recombinant nucleic acid molecule or recombinant 
25 vector of the present invention includes a terminator sequence or terminator sequences 
(e.g.,transcriptionterminatorsequences). The term 'terminator sequences" includes 
regulatory sequences which serve to terminate transcription of a gene. Terminator 
sequences (or tandem transcription terminators) can further serve to stabilize mRNA 
(e.g., by adding structure to mRNA), for example, against nucleases. 
30 ' m yetanomerembodiment,arecombinantnucleicacidmoleculeorrecombinant 

vector of the present invention includes sequences which allow for detection of the 
vector containing said sequences (i.e., detectable and/or selectable markers), for 
example, sequences that overcome auxotrophic mutations, for example, ura3 or ilvE, 
fluorescent markers, and/or colorimetric markers (e.g., / fl c2/p-galactosidase), and/or 
35 antibiotic resistance genes (e.g., amp or tet). 
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In yet another embodiment, a recombinant nucleic acid molecule or recombinant 
vector of the present invention includes an artificial ribosome binding site <RBS). The 
term "artificial ribosome binding site<RBS)" includes a site within an mRNA molecule 
{e.g., coded within DNA) to which a ribosome binds teg., to initiate translation) which 

5 differs from a native RBS {e.g. , a RBS found in a iiaturally<>ccurring gene) by at least 
one nucleotide. Preferred artificial RBSs include about 5-6, 7-8, 9-10, 1 1-12, 13-14, 15- 
16, 17-18, 19-20, 21-22, 23-24, 25-26, 27-28, 29-30 or more nucleotides of whkh about 
1-2, 3-4, 5-6, 7-8, 9-10, 11-12, 13-15 or more-differ from the native RBS {e.g., the 
native RBS of a gene of interest). Preferably, nucleotides which differ are substituted 

10 such that they are identical to one or more nucleotides of an ideal RBS {e.g. , SEQ ID 
N0.44, SEQ ID NO:45, SEQ ID N0.46, SEQ ID*IO:47 or SEQ ID NO:4S), when 
optimally aligned for comparisons. Artificial RBSs can be used to replace the naturally- 
occurring or native RBS associated with a particular gene. Artificial RBSs preferably 
■ increase translation of a particular gene. Preferred artificial RBSs (e.g., RBSs for 

1 5 increasing the translation of panB, for example, of B. subtilis panB) are depicted in 
Table 1A {e.g., SEQ ID NO:49 and SEQ ID NO:50). 

* i i ' 1 

Table I A: Preferred panB Ribosome Binding Sites 

20 io 20 

1 ) — 

AGAAAGGAGGTGA ideal RBS (SEQ ID NO: 44) 

CCCTCT - AG - AAGGAGGAGAAAACATG RBS1 {SEQ ID NO: 49) 

CCCTCT-AG- - AGGAGG AGAAAACATG RBS 2 <SEQ ID NO: SO) 

25 T AAACAT - G - - AGGAGGAGAAAACATG panB native RBS <SEQ ID NO: 42) 



30 



Additional preferred artificial RBSs (eg., RBSs for increasing the translation of 
panD, for example, of B. subtilis panD) are depicted in Table lB<e.g~, SEQ ID NO:51, 
SEQ ID NO:52, SEQ ID NO:53 and SEQ ID NO:54). 

Table IB: Preferred panD Ribosome Binding Sites 



10 



20 



35 CTAGAAAAGGAGGAATTTAAATG 
TTAAGAAAGGAGGTGANNNNATG 

TTAGAAAGGAGGATTTAAATATG 
TTAGAAAGGAGGTTTAATTAATG 
40 TTAGAAAGGAGGTGATTTAAATG 
TTAGAAAGGAGGTGTTTAAAATG 
TTAGAAAGGAGGTGANNNNNATG 



pAN423 RBS (SEQ ID NO: 88) 
ideal RBS <SEQ ID NO: 45) 

new design A (SEQ ID NO: 51) 
new design B <SEQ ID NO:S2) 
new design CI (SEQ ID NO:S3) 
new design C2 <SEQ ID NO: 54) 
ideal RBS (SEQ ID NO: 46) 
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Additional preferred artificial RBSs (e.g. , RBSs for increasing the translation of 
panD, for example, of B subtilis panD) are depicted in Table 1C (e.g., SEQ ID NO:55, 
SEQ ID NO:56 and SEQ ID NO:57). The predicted amino acid sequence at the C- , 
5 terminus of the PanC protein is shown. The start codon for PanD translation is 
underlined. 

Table 1C: Additional Preferred panD Ribosome Binding Sites 

10 20 

10 ] J 



15 



20 



--A GAA AGG AGG TGA NNN NNN N ATG ideal RBS (SEQ ID NO:47) 

ATT CGA GAA ATG GAG AGA ATA TAA T ATG native panD RBS (SEQ ID NO: 43) 

He Arg Glu Met Glu Arg He * Met SEQ ID NO: 89 

__ A GAA AGG AGG TGA NNN NNN N ATG ideal RBS (SEQ ID NO:47) 

ATT CGA GAA AGG AGG TGA ATA TAA T ATG NDI (SEQ ID NO: 55) 

He Arg Glu Arg Arg * Met SEQ ID NO: 90 

ATT CGA GAA AGG AGG TGA ATA ATA - ATG NDII (SEQ ID NO: 56) 

He Arg Glu Arg Arg * Met SEQ ID NO: 90 



ATT CGT AGA AAG GAG GTG AAT TAA T ATG NDIII (SEQ ID NO: 57) 
25 He Arg Arg Lys Glu Val Asn * Met SEQ ID NO: 91 

AGA AAG GAG GTG ANN NNN N ATG ideal RBS (SEQ ID NO:48) 



30 Accordingly, in one embodiment, a vector of the present invention includes an 

artificial RBS as set forth in SEQ ID NO:49 or SEQ ID NO.50. In another embodiment, 
a vector of the present invention includes an artificial RBS as set forth in SEQ ID 
NO:51, SEQ ID NO:52, SEQ ID NO:53 or SEQ ID NO:54. In yet another embodiment, 
a vector of the present invention includes an artificial RBS as set forth in SEQ ID 

35 NO:55, SEQ ID NO:56 or SEQ ID NO:57. 

In another embodiment, a recombinant vector of the present invention includes 
sequences that enhance replication in bacteria (e.g., replication-enhancing sequences). 
In one embodiment, replication-enhancing sequences are derived from E. colu In 
another embodiment, replication-enhancing sequences are derived from pBR322 (e.g., 

40 sequences included within the pBR322 derived portion of any of the pAN vectors as set 
forth in the Figures , i.e., the Not I-Not I sequences from about 5.0 kB to 9.0 kB of the 
vector depicted in Figure 3 A). 
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In yet another embodiment, a recombinant vector of the present invention 
includes antibiotic resistance genes. Hie term "antibiotic resistance genes" includes 
sequences which promote or confer resistance to antibiotics on the host organism (e.g., 
Bacillus), In one embodiment, the antibiotic resistance genes are selected from the 

5 group consisting of cat (chloramphenicol resistance) genes, tet (tetracycline resistance) 
genes, erm (erythromycin resistance) genes, neo {neomycin resistance) genes and spec 
(spectinomycin resistance) genes. Recombinant vectors of the present invention can 
further include homologous recombination sequences (e.g., sequences designed to allow 
recombination of the gene of interest into the chromosome of the host organism). For 

10 example, amyE sequences can be used as homology targets for recombination into the 
host chromosome. 

Preferred vectors of the present invention include, but are not limited to, vectors 
set forth in Figures 2-15, 17, 19, 22, 25 and 26. It will further be appreciated by one of 
, skill in the art that the design of a vector can be tailored depending on such factors as the 
1 5 choice of microorganism to be genetically engineered, the level of -expression -of gene 
product desired and the like. 

IX. Isolated Proteins 
Another aspect of the present invention features isolated proteins {e.g. , isolated 

20 pantothenate biosynthetic enzymes and/or valine-isoleucine biosynthetic enzymes and/or 
isolated Co A biosynthetic enzymes, for example isolated CoaAt>rCoaX). In one 
embodiment, proteins (e.g., isolated pantothenate biosynthetic enzymes and/or valine- 
isoleucine biosynthetic enzymes and/or isolated CoaA biosynthetic enzymes, for 
example isolated CoaA or CoaX) are produced by recombinant DN A techniques and can 

25 be isolated from microorganisms of the present invention by an appropriate purification 
scheme using standard protein purification techniques. In another embodiment, proteins 
are synthesized chemically using standard peptide synthesis techniques. 

An "isolated" or "purified" protein {e.g., an isolated or purified biosynthetic 
enzyme) is substantially free of cellular material or other contaminating proteins from 

30 the microorganism from which the protein is derived, or substantially free frrom 
chemical precursors or other chemicals when chemically synthesized. In one 
embodiment, an isolated or purified protein has less than about 30% (by dry weight) of 
contaminating protein or chemicals, more preferably less than about 20% of 
contaminating protein or chemicals, still more preferably less than about 10% of 

35 contaminating protein or chemicals, and most preferably less than about '5% 
contaminating protein or chemicals. 
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In a preferred embodiment, the protein or gene product is derived from Bacillus 
(e.g., is Bacftfws-derived). The term "derived from Bacillus 9 ' or "Bacitfus-derived" 
includes a protein or gene product which is encoded by a Bacillus gene. Preferably, the 
gene product is derived from a microorganism selected from the group consisting of 
5 Bacillus subtilis, Bacillus lentimorbus, Bacillus lentus, Bacillus firmus, Bacillus 

pantothenticus, Bacillus amyloliquefaciens, Bacillus cereus, Bacillus circulars, Bacillus 
coagulans, Bacillus licheniformis, Bacillus megaterium, Bacillus pumilus, Bacillus 
thuringiensis, and other Group 1 Bacillus species, for example, as characterized by 16S 
rRNA type (Priest, supra). In another preferred embodiment, the protein or gene 
10 product is derived from Bacillus brevis or Bacillus stearothermophilus. In another 
preferred embodiment, the protein or gene product is derived from a microorganism 
selected from the group consisting of Bacillus licheniformis, Bacillus amyloliquefaciens, 
Bacillus halodurans, Bacillus subtilis, and Bacillus pumilus. In a particularly preferred 
embodiment, the protein or gene product is derived from Bacillus subtilis {e.g., is 
1 5 Bacillus subrtlis-denved). The term "derived from Bacillus subtilis" or "Bacillus 
sw&ri/is-derived" includes a protein or gene product which is encoded by a Bacillus 
subtilis gene. In yet another preferred embodiment, the protein or gene product is 
encoded by a Bacillus gene homologue (e.g., a gene derived from a species distinct from 
Bacillus but having significant homology to a Bacillus gene of the present invention, for 
20 example, a Bacillus pan gene or Bacillus ilv gene). 

Included within the scope of the present invention are bacterial-derived proteins 
or gene products and/or Ztac?7/ws-derived proteins or gene products {e.g., B. subtilis- 
derived gene products) that are encoded by naturally-occurring bacterial and/or Bacillus 
genes (e.g., B. subtilis genes), for example, the genes identified by the present inventors, 
25 for example, Bacillus or B. subtilis coaX genes, coaA genes, pan genes and/or ilv genes. 
Further included within the scope of the present invention are bacterial-derived proteins 
or gene products and/or Bacillus-derived proteins or gene products (e.g., B. subtilis- 
derived gene products) that are encoded bacterial and/or Bacillus genes (e.g., B. subtilis 
genes) which differ from naturally-occurring bacterial and/or Bacillus genes {e.g., B. 
30 subtilis genes), for example, genes which have nucleic acids that are mutated, inserted or 
deleted, but which encode proteins substantially similar to the naturally-occurring gene 
products of the present invention. For example, it is well understood that one of skill in 
the art can mutate (e.g., substitute) nucleic acids which, due to the degeneracy of the 
genetic eode, encode for an identical amino acid as that encoded by the naturally- 
35 occurring gene. Moreover, it is well understood that one of skill in the art can mutate 
(e.g., substitute) nucleic acids which encode for conservative amino acid substitutions. 
It is further well understood that one of skill in the art can substitute, add or delete 
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amino acids to a certain degree without substantially affecting the function of a gene 
product as compared with a naturally-occurring gene product, each instance of which is 
intended to be included within the scope of the present invention. 

In a preferred embodiment, an isolated protein of the present invention (e. g. , an 
5 isolated pantothenate biosynthetic enzyme and/or an isolated isoleuciqe-valine 
biosynthetic enzyme and/or an isolated CoaA biosynthetic enzymes, for -example 
isolated CoaA or CoaX) has an amino acid sequence shown in SEQ ID NO:3, SEQ ID 
NO:9, SEQ ID NO:24, SEQ ID NO:26, SEQ ID NO:28, SEQ ID NO:30, SEQ ID 
NO:32, SEQ ID NO:34, SEQ ID NO:36, SEQ ID NO:38 or SEQ ID NO:87. In other 
10 embodiments, an isolated protein of the present invention is a homologue of the at least 
one of the proteins set forth as SEQ ID NO:3, SEQ ID NO:9, SEQ ID NO:24, SEQ ID 
NO:26, SEQ ID NO:28, SEQ ID NO:30, SEQ ID NO:32, SEQ ID NO:34, SEQ ID 
NO:36, SEQ ID NO:38 or SEQ ID NO:87 (e.g., comprises an pmino acid sequence at 
least about 30-40% identical, preferably about 40-50% identical, more preferably about 
1 5 50-60% identical, and even more preferably about -60-70%, 70-80%, 80-90%, 90-95% or 
more identical to the amino acid sequence of SEQ ID NO:3, SEQ ID NO:9, "SEQ ID 
NO:24, SEQ ID NO:26, SEQ ID NO:28, SEQ ID NO:30, SEQ ID NO:32, SEQ ID 
NO:34, SEQ ID NO:36, SEQ ID NO:38 or SEQ ID NO:87, and has an activity that is 
substantially similar to that of the protein encoded by the amino acid sequence of SEQ 
20 ID NO:3, SEQ ID NO:9, SEQ ID NO:24, SEQ ID NO:26, SEQ ID NO:28, SEQ ID 
NO:30, SEQ ID NO:32, SEQ ID NO:34, SEQ ID NO:36, SEQ ID NO:38 or SEQ ID 
NO:87, respectively. 

To determine the percent homology of two amino acid sequences or of two 
nucleic acids, the sequences are aligned for optimateomparison purposes (e.g., gaps <:an 
25 be introduced in the sequence of a first amino acid or nucleic acid sequence for optimal 
alignment with a second amino or nucleic acid sequence). When a position in the first 
sequence is occupied by the same amino acid residue or nucleotide as the corresponding 
position in the second sequence, then the molecules are identical at that position. The 
percent identity between the two sequences is a function of the number of identical 
30 positions shared by the sequences (i.e., % identity = # of identical positions/total # of 
positions x 100), preferably taking into account the number of gaps and size of said gaps 
necessary to produce an optimal alignment. 

The comparison of sequences and determination of percent homology between 
two sequences can be accomplished using a mathematical algorithm. A preferred, non- 
35 limiting example of a mathematical algorithm utilized for the comparison of sequences 
is the algorithm of Karlin and Altschul (1990) Proc. Natl Acad. Sci. USA 87:2264-68, 
modified as in Karlin and Altschul (1993) Proc. Natl. Acad. Sci. USA 90:5873-77. Such 
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an algorithm is incorporated into the NBLAST and XBLAST programs (version, 2.0) of 
Altschul et al (1990) J. Mol Biol 215:403-10. BLAST nucleotide searches can be 
performed with the NBLAST program, score = 100, wordlength = 12 to obtain 
nucleotide sequenced homologous to nucleic acid molecules of the invention. BLAST 
5 protein searches can be performed with the XBLAST program, score = 50, wordlength = 
3 to obtain amino acid sequences homologous to protein molecules of the invention. To 
obtain gapped alignments for comparison purposes, Gapped BLAST can be utilized as 
described in Altschul et al (1997) Nucleic Acids Research 25(17):3389-3402. When 
utilizing BLAST and Gapped BLAST programs, the default parameters of the respective 
10 programs (e.g., XBLAST and NBLAST) can be used. See http://www.ncbi.nlm.nih.gov. 
Another preferred, non-limiting example of a mathematical algorithm utilized for the 
comparison of sequences is the algorithm of Myers and Miller (1988) Comput Appl 
Biosci. 4:1 1-17. Such an algorithm is incorporated into the ALIGN program available, 
for example, at the GENESTREAM network server, IGH Montpellier, FRANCE 
1 5 (http://vega.igh.cnrs.fr) or at the ISREC server (http://www.ch.embnet.org). When 
utilizing the ALIGN program for comparing amino acid sequences, a PAM120 weight 
residue table, a gap length penalty of 12, and a gap penalty of 4 can be used. 

In another preferred embodiment, the percent homology between two amino acid 
sequences can be determined using the GAP program in the GCG software package 
20 (available at http://www.gcg.com), using either a Blossom 62 matrix or a PAM250 
matrix, and a gap weight of 12, 10, 8, 6, or 4 and a length weight of 2, 3, or 4. In yet 
another preferred embodiment, the percent homology between two nucleic acid 
sequences can be accomplished using the GAP program in the GCG software package 
(available at http://www.gcg.com), using a gap weight of 50 and a length weight of 3. 

25 

X. Biotransformatiom and Biocorrversions 

Another aspect of the present invention includes biotransformation processes 
which feature recombinant microorganisms (e.g., mutant microorganisms) and/or 
isolated CoA, pantothenate or isoleucine-valine biosynthetic enzymes described herein. 

30 The term "biotransformation process", also referred to herein as "bioconversion 
processes", includes biological processes which result in the production (e.g., 
transformation or conversion) of any compound (e.g., intermediate or product) which is 
upstream of a CoA, pantothenate or isoleucine-valine biosynthetic enzyme to a 
compound (e.g. , substrate, intermediate or product) which is downstream of said CoA, 

35 pantothenate or isoleucine-valine biosynthetic enzyme. 
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In one embodiment, the invention features a biotransformation process for the 
production of a panto-compound comprising contacting a microorganism which 
overexpresses at least one pantothenate biosynthetic -enzyme with at least one 
appropriate substrate or precursor under conditions such that said panto-compound is 
5 produced and recovering said panto-compound.. In a preferred embpdiment, the 

invention features a biotransformation process for the production of pantoate comprising 
contacting a microorganism which over-expresses ketopantoate reductase {the pan£ gene 
product) with an appropriate substrate<e.g., ketopantoate) under conditions such that 
pantoate is produced and recovering said pantoate. In another preferred -embodiment, 
10 the invention features a biotransformation process for the production of pantothenate 
comprising contacting a microorganism which overexpresses ketopantoate reductase and 
pantothenate synthetase with appropriate-substrates<e.g., ketopantoate and p-alanine) 
under conditions such that pantothenate is produced and recovering said pantothenate. 
. In yet another preferred embodiment, the invention features a biotransformation process 
1 5 for the production of pantothenate comprising contacting a microorganism which 
overexpresses ketopantoate hydroxymethyltaransferase, ketopantoate reductase and 
pantothenate synthetase with appropriate substrates (e.g. , a-ketoisovalerate and £- 
alanine) under conditions such that pantothenate is produced and recovering said 
pantothenate. Preferred recombinant microorganisms for carrying out the above- 
20 described biotransformations include pantothenate kinase mutants. Conditions under 
which pantoate or pantothenate arc produced can include any conditions which result in 
the desired production of pantoate or pantothenate, respectively. 

In yet another embodiment, the present invention includes a method of 
producing p-alanine that includes culturing a microorganism which overexpresses 
25 aspartate-a-decarboxyiase under conditions such that ^-alanine is produced. Preferably, 
the aspartate-a-decarboxylase-overexpiessing microorganism has a mutation in a nucleic 
acid sequence encoding a pantothenate biosynthetic enzyme selected from the group 
consisting of ketopantoate hydroxymethyltransferase, ketopantoate reductase and 
pantothenate synthetase. 
30 The invention further features a method of producing ^-alanine that includes 

contacting a composition comprising aspartate with an isolated Bacillus aspartate-a- 
decarboxylase enzyme under conditions such that p-alanine is produced (e.g. , an in vitro 
synthesis method). 

The microorganism(s) and/or -enzymes used in the biotransformation reactions 
35 are in a form allowing them to perform their intended function^. g. , producing a desired 
compound). The microorganisms can be whole cells, orcan be only those portions of 
the cells necessary to obtain the desired end result The microorganisms can be 
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suspended (e.g., in an appropriate solution such as buffered solutions or media), rinsed 
(e.g., rinsed free of media from culturing the microorganism), acetone-dried, 
immobilized (e.g., with polyacrylamide gel or k-carrageenan or on synthetic supports, 
for example, beads, matrices and the like), fixed, cross-linked or permeablized (e.g.,, 
5 have permeablized membranes and/or walls such that compounds, for example, 
substrates, intermediates or products can more easily pass through said membrane or 
wall). 

Purified or unpurified CoA biosynthetic enzyme(s) (e.g., CoaA and/or CoaX), 
pantothenate biosynthetic enzyme(s) and/or valine-isoleucine biosynthetic enzyme(s) 
1 0 can also be used in biotransformation reactions. The en2yme can be in a form that 
allows it to perform its intended function {e.g., obtaining the desired compound). For 
example, the enzyme can be in free form or immobilized. Purified or unpurified CoA 
biosynthetic enzyme(s), pantothenate biosynthetic enzyme(s) and/or valine-isoleucine 
biosynthetic enzyme(s) can be contacted in one or more in vitro reactions with 
1 5 appropriate substrate(s) such that the desired product is produced. 

With respect to at least the above-described methodologies (e.g., the production 
methodologies of the present invention), at least one aspect of the invention features the 
folowing: embodiments is which the methods do not use microorganisms of the genus 
Corynebacterium and/or microorganisms of the genus Escherichia-, embodiments in 
20 which the methods do not use microorganisms selected from the group consiting of 

Escherichia coli and Corynebacterium glutamicum; embodiments in which the methods 
do not use gram negative microorganisms; embodiments in which the microorganisms 
utilized do not include, express or produce nucleic acid molecules, genes or proteins 
(e.g., biosynthetic emzymes) derived from microorganisms of the genus 
25 Corynebacterium and/or microorganisms of the genus Escherichia; embodiments in 
which the microorganisms to not include, express or produce nucleic acid molecules, 
genes or proteins (e.g., biosynthetic emzymes) derived from microorganisms selected 
from the group consisting of Escherichia coli and Corynebacterium glutamicum. 

30 XI. Screening Assays 

Because CoA is an essential factor in bacteria, proteins (e.g. , enzymes) involved 
in the biosynthesis of CoA provide valuable tools in the search for novel anti-biotics. In 
particular, the CoaX protein is a valuable target for identifying bacteriocidal compounds 
because it bears no resemblance in primary sequence to mammalian pantothenate kinase 

35 enzymes. Accordingly, the present invention also provides a method (also referred to 
herein as a "screening assay") for identifying modulators, i.e., candidate or test 
compounds or agents {e.g., peptides, peptidomimetics, small molecules or other drugs) 
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which bind to CoaX, or have a stimulatory or inhibitory effect on, for example, <oaX 
expression or CoaX activity. 

In one embodiment, the invention provides assays for screening candidate or test 
compounds which are capable of binding to CoaX proteins or a biologically active 

5 portion thereof. In another embodiment, the invention provides assays for screening 
candidate or test compounds which modulate the activity of CoaX proteins or 
biologically active portions thereof. The test compounds of the present invention can be 
obtained using any of the numerous approaches in combinatorial library methods known 
in the art, including: biological libraries; spatially addressable parallel solid phase or, 

10 solution phase libraries; synthetic library methods requiring deconvolution; the 'one- 
bead one-compound' library method; and synthetic library methods using affinity 
chromatography selection. The biological library approach is limited to peptide 
libraries, while the other four approaches are applicable to peptide, non-peptide 
oligomer or small molecule libraries of compounds {Lam, K.S. (1997) Anticancer Drug 

15 Des. 12:145). 

Examples of methods for the synthesis of molecular libraries can be found in the 
art, for example in: DeWitt et al (1993) Proc. Natl Acad Sci., US. A. 90:6909, Erb et 
al (1994) Proc. Natl Acad.Sci. USA 91:1 1422;Zuckermann etal (1994). J. Med 
Chem. 37:2678; Cho et al (1993) Science 261:1303; Carrell et al. (1994) Angew. Chem. 

20 Int. Ed: Engl 33:2059; Carell etal (1994) Angew. Chem Int. Ed Engl 33:2061; and in 
Gallop etal (1994) J. Med. Chem. 37:1233. Libraries of compounds may be presented 
in solution (e.g., Houghten<1992) Bibtechniques 13:412-421), or on beads {Lam (1991) 
Nature 354:82-84), chips (Fodor (1993) Nature 364:555-556), bacteria {Ladner USP 
5,223,409), spores (Ladner USP '409), plasmids<Cull et al (1992) Proc Natl AcadSci 

25 USA 89:1865-1869) or on phage{Scott and Smith (1990) Science 249:386-390);{Devlin 
(1990) Science 249:404-406); (Cwirla et al <1990) Proc. Natl Acad. Scl 87:6378- 
6382); (Felici (1991) J. Mol Biol 222:301-310); (Ladner supra.). 

In one embodiment, an assay is a microorganism-based assay in which a 
recombinant microorganism which expresses a CoaX protein or biologically active 

30 portion thereof is contacted with a test compound and the ability of the test compound to 
modulate CoaX activity is determined. Determining the ability of the test compound to 
modulate CoaX activity can be accomplished by monitoring, for example, intracellular 
phosphopanthoate or Co A concentrations or secreted pantothenate concentrations (as 
compounds that inhibit CoaX will result in a buildup of pantothenate in the test 

35 microorganism). CoaX substrate can be labeled with a radioisotope or enzymatic label 
such that modulation of CoaX activity can be determined by detecting a conversion of 
labeled substrate to intermediate or product For example, CoaX substrates can be 
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labeled with 32p ? 14^ or 3jj, either directly or indirectly, and the radioisotope detected 
by direct counting of radioemmission or by scintillation counting. Determining the 
ability of a compound to modulate CoaX activity can alternatively be determined by 
detecting the induction of a reporter gene (comprising a CoA-responsive regulatory 

5 element operatively linked to a nucleic acid encoding a detectable marker, e.g. , 
luciferase), or detecting a CoA-regtilated cellular response. 

In yet another embodiment, a screening assay of the present invention is a cell- 
free assay in which a CoaX protein or biologically active portion thereof is contacted 
with a test compound in vitro and the ability of the test compound to bind to or modulate 

10 the activity of the CoaX protein or biologically active portion thereof is determined. In 
a preferred embodiment, the assay includes contacting the CoaX protein or biologically 
active portion thereof with known substrates to form an assay mixture, contacting the 
assay mixture with a test compound, and determining the ability of the test compound to 
modulate enzymatic activity of the CoaX on its substrates. 

1 5 Screening assays can be accomplished in any vessel suitable for containing the 

microorganisms, proteins, and/or reactants. Examples of such vessels include microtiter 
plates, test tubes, and micro-centrifiige tubes. In more than one embodiment of the 
above assay methods of the present invention, it may be desirable to immobilize either . 
CoaX protein or a recombinant microorganism expressing CoaX protein to facilitate 

20 separation of products and/or substrates, as well as to accommodate automation of the 
assay. For example, glutathione-S-transferase/CoaX fusion proteins can be adsorbed 
onto glutathione sepharose beads (Sigma Chemical, St. Louis, MO) or glutathione 
derivatized microtiter plates. Other techniques for immobilizing proteins on matrices 
(e.g., biotin-conjugation and streptavidin immobilization or antibody conjugation) can 

25 also be used in the screening assays of the invention. 

In another embodiment, modulators of CoaX expression are identified in a 
method wherein a cell is contacted with a candidate compound and the expression of 
coaXmRNA or CoaX polypeptide in the cell is determined. The level of expression in 
the presence of the candidate compound is compared to the level of expression in the 

30 absence of the candidate compound (or to a suitable control, for example, an appropriate 
buffer control or standard). The candidate compound can then be identified as a 
modulator of coaXmRNA or CoaX polypeptide expression based on this comparison. 

This invention further pertains to novel agents identified by the above-described 
screening assays. Accordingly, it is within the scope of this invention to further use an 

35 agent identified as described herein in an appropriate animal model. For example, an 
CoaX modulating agent identified as described herein (e.g., an anti-bactericidal 
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compound) can be used in an infectious animal model to determine the efficacy, 
toxicity, or side effects of treatment with such an agent. 

This invention is further illustrated by the following examples which should not 
be construed as limiting. The contents of all references, patents, patent applications 

5 {including U.S. Patent Application Serial No. 09/400,494, filed September 21, 1999 
(pending), provisional U.S. Patent Application Serial No. 60/210,072, filed June 7, 
2000, provisional U.S. Patent Application Serial No. 60/221,938, filed July 28, 2000 and 
provisional U.S. Patent Application Serial No. 60/227,860, filed August 24, 2000, to 
which this application relates) and published patent applications cited throughout this 

1 0 application are incorporated herein by reference. 



BNSOOClD:<WO 0121772A3 IA> 



WO 01/021772 



-59- 



PCT/US00/25993 



EXAMPLES 

i 

General Methodology: 

Strains. Bacillus subtilis strains of the present invention are generally derived 
' fiom either of two strains. The first is variously named "168", "1A1", or "RL-1". The • 
5 genotype is trpC2. This strain was derived fiom the wild type "Marburg" strain by 
mutagenesis and has been the basis of much of the molecular biology work done on B. 
subtilis. The second strain is PY79, a prototrophic derivative of 168 that was made Trp + 
by transduction fiom the wild type strain W23. 

Media. Standard minimal medium for B. subtilis is comprised of 1 x Spizizen 
1 0 salts and 0.5% glucose. Standard solid "rich medium" is Tryptone Blood Agar Broth 
(Difco), and standard liquid "rich medium" is VY, a mixture of veal infusion broth and 
yeast extract. For testing production of pantothenate in liquid test tube cultures, an 
enriched form of VY, called "Special VY' or "SVY" is used. For batch fermentations, 
' SVYG and PFMG are used. The compositions of these media are given below. 

15 VY, a rich liquid medium: 25 g Difco Veal Infusion Broth, 5 g Difco Yeast 

Extract, 1L water (autoclave). 

TBAB, a rich solid medium: 33 g Difco Tryptone Blood Agar Broth, 1L water 
(autoclave). 

MIN, a minimal medium: 100 ml 10 x Spizizen salts; 10 ml 50% glucose; 2 ml 
20 10% arginine HC1*; 10 ml 0.8% tryptophan**; water to 1 liter. (*In some cases, 
arginine is omitted or replace by sodium glutamate at 0.04% final concentration. In 
general, B. subtilis grows faster in minimal medium when certain amino acids, such as 
arginine, glutamine, glutamate, or proline, are added as an auxiliary nitrogen source. ) 
(**For strains that are tryptophan autotrophs, tryptophan is routinely added to most 
25 minimal media.) 

10 x Spizizen Salts: 174 g K 2 HP0 4 -3H,0; 20 g (NH^SO,; 60 g KH 2 PO<; 10 g 
Na 3 Citrate-2H 2 0; 2 g MgS0 4 -7H 2 0; water to 993 mis; then add 3.5 ml FeCl 3 solution and 
3.5 ml Trace Elements solution. 

FeClsSolution: 4 g FeCl 3 -6H 2 0; 197 g Na,Citrate-2H 2 0; water to 1 liter (filter 
30 sterilize) 

Trace Elements Solution: 0.15 g Na,Mo0 4 -2H 2 0; 2.5 g H 3 B0 3 ; 0.7 g 
CoCl 2 -6H 2 0; 0.25 g CuS0 4 -5H 2 0; 1.6 g MnCl 2 -4H 2 0; 0.3 g ZnS0 4 -7H 2 0; water to 1 liter 
(filter sterilize). 
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• 

SVY, Special VY, a supplemented* rich medium for testing pantothenate 
production in test tube cultures: 25 g Difco Veal Infusion Broth; 5 g Difco yeast 
extract; 5 g sodium glutamate; 2.7 g ammonium sulfate; 740 ml water {autoclave); add 
200 ml 1 M potassium phosphate, pH 7.0; 60 ml 50% glucose. (*For testing 
5 pantothenate production in liquid S VY test tube cultures, Na a-ketoisovalerate and/or {J- 
alanine can be added to a concentration of 5 -g/L from filter-sterilized stocks.) 

PFMG, a yeast extract based medium used in fermentors: 20 g Amberex 
1 003™ yeast extract; 5 g sodium glutamate, 2 g ammonium sulfate; 5 g tryptophan; 10 g 
KH 2 P04; 20 g K 2 HP0 4 -3H 2 0; 1 g MgCl 2 -6H 2 0; 0.1 g CaCl 2 -2H 2 0; 1 g sodium citrate; 
10 0.01 g FeS(V7H 2 0; 1 ml trace elements solution; 20 g glucose; add water to 1 L. 

Glucose or other sugars are fed as needed. Feed solutions can contain minerals, defined 
or food grade nutrients. 

PF t a chemically defined pantothenate free medium for testing pantothenate 
auxotrophy: 100 ml 10 x Spizizen Salts; 100 ml 1 x Difco Pantothenate Assay Medium; 
15 10 ml 50% glucose; water to 1 liter. 

For pantothenate auxotrophs, 1 mM Na pantothenate is added to both minimal 
and rich media, since there is generally not enough pantothenate in rich media to support 
B. subtilis pan mutants. Amino acids are at 100 mg per liter, when used. 
Selection for antibiotic resistance is done with 5 mg/L chloramphenicol, 100 mg/L 
20 spectinomycin HC1, 15 mg/L tetracycline HC1, or 1 mg/L erythromycin plus 25 mg/L 
lincomycin. 

Pantothenate Assays: Biological assay. The indicator organism, Lactobacillus 
plantarum, requires pantothenate for growth, and responds to low concentrations (\xg/L). 

25 Thus, using serial dilutions, a wide range of concentrations can be assayed. 

Commercially available medium (e.g., Pantothenate Assay Medium (PAM), Di&o),-can 
be used. However, Difco PAM supplemented with pantothenate does not support 
growth to the same level as obtainable using a fresh-mixed version of Pantothenate 
Assay Medium (FM-PAM), made up of the individual -components as specified by 

30 Difco, which is accordingly, routinely used instead of the commercial product. 

Before assaying B. subtilis culture supernatants, the B. subtilis veils must be 
either removed or killed. B. subtilis culture supernatants give approximately the same 
pantothenate titer when the supernatants are autoclaved as when they are sterile filtered. 
Accordingly, routine procedures involve autoclaving samples for 5 minutes prior to the 

35 biological assay. 
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Pantothenate Assays: HPLC assay. Pantothenic acid production is measured 
by HPLC with a detector wavelength of 197 nm and a reference at 450 nm. The 
procedure is a modification of one recommended by Hewlett-Packard for water soluble 
vitamins. Samples of culture broth are diluted into an equal volume of 60% acetronitrile 
5 ' (ACN), centrifuged and filtered. Typically a further 1 0-fold dilution before analysis 
brings the final dilution to 20-fold. Higher concentrations of product are diluted further. 
Compounds are separated on a CI 8 Phenomenex 5u Aqua 250 x 4.6 mm column with 
5% acetronitrile (ACN) in 50 mM Na phosphate buffer at pH 2.5. An ACN gradient 
from 5% to 95% washes the column between every sample. The area of the 
10 pantothenate peak is proportional to the concentration between 5 to 1000 mg/L. Other 
panto-compounds are also separated and quantitated by this method. 

Amino Acid Analysis: HPLC assay. Amino acids present in the fermentation 
medium and throughout the fermentation are measured by HPLC with a detector 
wavelength of 338 nm and a reference at 390 nm. The procedure is a modification of 
15 one recommended by Hewlett-Packard for amino acid analysis. Samples of culture 
broth are prepared identically as for the panto-compound analysis. Compounds are 
separated on a C18 Hypersil 5u ODS 200 x 2.1 mm column. Solvent A is 20 mM Na 
acetate buffer at pH 7.2. Solvent B contains 40% ACN and 40% methanol. A gradient 
from 100% Solvent A to 100% Solvent B separates amino acids and washes the column 
20 between every sample. 

Batch Fermentations. Pantothenate producing strains are grown in stirred tank 
fermentors, for example, in CF3000 Chemap 14 liter vessels with 10 liter working 
volumes. Computer control and data collection is by commercial software, for example, 
B. Braun Biotech MFCS software. Fermentations can be batch processes but are 
25 preferably sugar-limited, fed batch processes. Some media components {e.g. of SVYG 
and PMFG) are added to the fermentor and sterilized in place. Portions of the media are 
sterilized separately and added to the fermentors aseptically. This procedure is well 
known to those familiar with the art Additional nitrogen sources in feeds are sterilized 
separately and added to the carbon source after cooling. 
30 The initial sugar in the medium is consumed in approximately 6 hours. 

Afterwards, glucose or other sugars are fed with the possible addition of minerals, and 
defined or food grade nutrients. Alternatively, feeds are scheduled based on a consensus 
profile of nutritional requirements from samples taken from earlier fermentations. 
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After inoculation, agitation is set at a relatively low speed, e.g. 200 rpm. When 
the dissolved oxygen <p02) falls to 30%, computer control automatically adjusts the 
agitation to maintain a dissolved oxygen concentration between 25 and 30% p02. 

5 EXAMPLE I: Enhanced Production of a Panto-Gompound Using Bacteria 
Overexpressing panBCD Gene Products. 

This Example describes the cloning of the B. subtilis panBCD operon and the 
generation of microorganisms overexpressing the panBCD gene products. 

To clone the B. subtilis panBCD operon, a plasmid library of B. subtilis GP275 
10 (a derivative of 168) genomic DNA was transformed in E. coli BM4062 (birA*), and 
temperature resistant clones were selected at 42°C. By comparison of restriction maps 
to the genome sequence, one particular clone was deduced to contain the B. subtilis birA 
gene and the adjacent panBCD genes. This plasmid was named pAN201 . 

To overexpress the panBCD operon and produce pantothenate, the native 
1 5 promoter of the panBCD operon was replaced by -either of two strong, constitutive 
promoters derived from the B. subtilis bacteriophage SP01. These two promoters are 
named P 36 and P 15 . In addition, either of two artificial ribosome binding sites (RBSs) 
were used to replace the native panB RBS. These two artificial RBSs (set forth as'SEQ 
ID NO:49 and SEQ ID NO:50) were predicted to increase translation of panBCD; their 
20 sequences are shown in Table 1 A. Three such -engineered panBCD expression cassettes 
were built into circular plasmids capable of replicating in E. coli. Other features of the 
plasmids include a strong rho-independent transcription terminator from the E. coli 
ribosomal RNA transcription unit, called TiT 2 , a Gram-positive chloramphenicol 
resistance gene (cat), derived from pC194, and a pair of Wort restriction sites at the 
25 junctions between the E. coli replicon and the segment intended for integration into B. 
subtilis. Three plasmids of this series, pAN004, pAN005, and pAN006 were 
constructed. pAN004 contains the P 2 6 promoter, RBS1 , and a iowtx>py E. coli replicon. 
pANOOS contains the P/5 promoter, which in our-experience is not as strong as P 2 e, 
RBS1 , and the low copy replicon. pAN006 contains the Pie promoter, RBS2, and a 
30 medium copy replicon. 

The three panBCD expression -cassettes contained in the above-mentioned three 
plasmids weie all ligated to a DNA fragment consisting of sequences that naturally 
occur immediately upstream from the native panB gene and integrated in single copy by 
homologous recombination into the panBCD locus of B. subtilis strains RL-1 and PY79, 
35 replacing the wild-type operon. This was accomplished in two steps. First a deletion- 
substitution that replaced about two thirds of the panB xxxiing region with a Gram- 
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positive spectinomycin resistance gene (spec) was integrated at panB to yield Spec', 
pantothenate auxotrophs. These intermediate strains were than transformed with the 
pcmBCD expression cassettes of pAN004, pAN005, and pAN006 after ligating them to a 
DNA fragment containing chromosomal sequences just upstream of panB. Selection of 

5 the incoming cassette was for pantothenate prototrophy. The resulting strains were 
named PA221 , PA222 and PA223 (from RL-1), and PA235, PA232and PA233 (from 
PY79), respectively. An example of a plasmid that contains the joined upstream 
sequence that is in the integrated strain in PA221 is pAN240 (see Figure 2). The 
nucleotide sequence of pAN240 is set forth as SEQ ID NO:76. 

1 0 Polymerase chain reaction using appropriate primers was used to verify the 

correct chromosomal structures of these engineered strains. When extracts of strain 
PA221 were examined by SDS-PAGE, two proteins were found to be overexpressed. 
One protein had an apparent molecular weight of 29,000 and the other protein appeared 
to be 39,000 daltons. The 29,000 dalton bands is presumably PanB (predicted molecular 

15 weight of 29,761). The larger protein band presumably represents PanC (predicted size 
3 1,960 daltons). 

The ability of these strains to produce pantothenate in test tube cultures was 
assesed as follows. Each strain was grown in SVY medium supplemented with 5 g/L a- 
ketoisovalerate (a-KTV) and 5 g/L p-alanine, to ensure that these precursors were not 
20 limiting. Culture supernatants were autoclaved and assayed using the bioassay. 

Relative to the parent strains, RL-1 and PY79, the engineered strains produced about 8- 
to 30-fold more pantothenate, attaining 1 g/L pantothenate in some cases. 
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Table 2. Production of pantothenate by engineered B, subtilis strains in liquid test 
tube cultures grown in SVY medium with 5 g/L a-KIVand 5 g/L <f3- 
alanine. 
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5 The Pie promoter was about 3- to 4-fokl more effective than the P iS promoter, 

while RBS1 and RBS2 were roughly equivalent Plasmids such as pAN004 5 pAN005, 
pAN006 can also be recombined as circles into the B. subtilis wild type panBCD locus 
by Campbell-type {single crossover) integration, selecting for chloramphenicol 
resistance at 5 mg/L. Strains obtained in this fashion produce about the same amount of 

10 pantothenate as strains PA221 , PA222, and PA223, respectively. pANOM containing 
the F26 promoter, RBS1 and a low copy E. coli replicon, is depicted schematically in 
Figure 3 A. The nucleotide sequence of plasmid pAN004 is set forth as SEQ ID NO:93. 
pAN006 containing the P 2 e promoter, RBS2 and a medium copy E. coli replicon, is 
depicted schematically in Figure 3B. The nucleotide sequence of plasmid pAN006 is set 

1 5 forth as SEQ ED NO:94. The nucleotide sequence of panBCD is set forth as SEQ ID 
NO:59 and the predicted amino acid sequences of PanB, PanC and PanD are set forth as 
SEQ ID NO:24, SEQ ID NO:26 and SEQ ID NO:28, respectively. Methods for 
manipulating Bacilli are described, for example, in Harwood, C.R. and Cutting, S.M. 
(editors), Molecular Biological Methods for Bacillus (1990) John Wiley & Sons, Ltd., 

20 Chichester, England, the content of which is incorporated herein by reference. 



Ot?l77?A3 IA> 



WO 01/021772 PCT/US00/25993 

-65- 

EXAMPLE II: Enhanced Production of a Panto-Compound Using Bacteria 
Overexpressing the panEl Gene Product - Ketopantqate Reductase. 

This Example describes the cloning of the B. subtilis panEl gene and the 
generation of microorganisms overexpressing the panEl gene product. 

5 Pan" B. subtilis strains {e.g. , B. subtilis mutants blocked in the synthesis of 

pantothenic acid) had previously been isolated, one of which was reported to be affected 
in ketopantoate reductase activity (Baigori et al (1991) 1 Bacterid 173:4240-4242). 
However, the mutations in these strains were incorrectly mapped to ihepurE-tre interval 
of the B. subtilis genetic map which does not contain the panE oxpanBCD genes. 
1 0 Furthermore as shown below, a panE mutant does not have a Pan" phenotype as the ilvC 
gene product can substitute for the panE gene product in B, subtilis as in other bacterial 
strains such as E. colt More recently, the S. typhimuruim panE gene has been located 
and determined to be allelic to apbA, a gene required for anaerobic purine biosynthesis 
(Frodymae/a/. (1998) J. Biol Chem. 273:5572-5576). E. coli carries a highly 
1 5 homologous gene at the same map location. Identification of the panE genes in E. coli 
and S. typhimurium was complicated by the fact that the ilvC gene product, 
acetohydroxy acid isomeroreductase, is also capable of carrying out the ketopantoate 
reductase reaction. As a result, pantothenate auxotrophy is not obtained unless both 
panE and ilvC are mutated. 
20 To identify the B. subtilis panEl gene, the B. subtilis genome was searched using 

the protein sequence of £ coli or & typhimurium ApbA (PanE), and two open reading 
frames were identified having homology to ApbA, named ylbQ wAykpB. These genes 
were renamed panEl and panE2, due to their proposed function in pantothenate i 
biosynthesis. Both panEl and panE2 were cloned as PCR products generated from 
25 RL-1 genomic DNA as a template. Both genes were disrupted by either a 

spectinomycin resistance gene {spec) or a chloramphenicol resistance gene {cat). The 
interrupted genes were each integrated by double crossover into PY79 to give PA240 
{ApanEl::spec) and PA241 {ApanE2::cat). Neither of these strains were pantothenate 
auxotrophs when tested on pantothenate-free (PF) plates, although PA240 containing 
30 ApanEl::spec grew slightly more slowly on TBAB without added pantothenate than 
with a 1 mM pantothenate supplement. By comparison, a bpanBwspec strain does not 
produce single colonies on TBAB, presumably because B. subtilis has no active uptake 
system for pantothenate. 

It was hypothesized that the A subtilis gene, z/vC, could function for panE as had 
35 been shown for E. colu Accordingly, the panEl and panE2 disruptions were introduced 
into a strain, CU550, which is reported to be trpC2 ilvC4 leuC124. Both the single 
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panEl and the double panEl, panE2 disruptants were pantothenate auxotrophs on PF 
medium. 



Table 3. Phenotypes of various panEl and panE2 mutants on rich and defined 



media. 






Growth*: 




Strain 


Medium 


-pan 


+ pan 


PY79 


TBAB 


+++ 


+++ 




PF 




-H- 


PA240 


TBAB spec 


+ 


+++ 




PF 


++ 


++ 


PA241 


TBAB cam 


+++ 


+-H- 




PF 


++ 


++ 


CU550 


TBAB 


+++ 


+++ 




PF 


++ 


-Hr 


PA256 


TBAB spec 




+++ 




PF 




++ 


PA258 


TBAB spec, cam 




+++ 




PF 




++ 


♦Each 4 


represents about 1 mm of colony diametei 



Thus, mutating both panEl and ilvC results in pantothenate auxotrophy, while 
mutating only panEl does not, similar to what has been reported for Ecoli and S. 
10 typhimurium. 

Next, the quantitative effect of panEl and panE2 knockouts in a pantothenate 
overproducing strain (PA235 described herein) was examined. The panEl and panE2 
disruptions were introduced into PA235, either singly or together to produce PA245 
(Apan£i::spec), PA248 (Apan£2::cat) andPA244{Apafl£7::cat, ApanE2::spec). The 
1 5 effect of each mutation on pantothenate production was then tested in liquid test tube 
cultures. 
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Table 4. Pantothenate production by PA235 derivatives containing panEl and 
panE2 dismptiom. 



btrain 


JY>an1 ms/L 


% of PA235 


PA235 


990 


(100) 


PA235 


940 


95 


PA245 


59 


6 


PA245 


82 


8 


PA248 


1060 


106 


PA248 


1030 


104 


PA244 


25 


3 


PA244 


50 


5 



Thus, deletion analysis indicated that ite panEl gene contributes to over 90% of 
5 the pantothenate production, while deletion of panE2 did not have a significant effect on 
pantothenate production. It is therefore concluded that panEl accounts for most, but not 
necessarily all, of the ketopantoate reductase activity in B. subtilis. The rest of the 
ketopantoate reductase activity is predicted to be supplied by ilvC. 

Having identified panEl as an important gene for pantothenate production, 
10 increased^/ expression was tested to determine whether it could enhance 
pantothenate production in strains such as PA221 or PA235. The panEl coding 
sequence was installed downstream of the P 26 promoter and RBS2 in a vector, pOTP61, 
designed to integrate and amplify at either the bpr locus (a non-essential protease gene) 
or at the locus of the cloned insert. The resulting plasmid, pAN236 (Figure 4) was 
1 5 transformed into PA221 , selecting for resistance to tetracycline at 1 5 mg/L. The 
nucleotide sequence of pAN236 is set forth as SEQ ID NO:77. One transformant, 
named PA236 was chosen for further study. 

PA236 was shown to overexpress a protein of about 31,000 daltons, which is 
close to the expected molecular weight of 33,290 daltons for panEl protein. Briefly, 
20 whole cell extracts were prepared from PY79, RL-1, PA221, PA221/pOTP61and PA236 
(2 samples). Cell extracts were separated by gel electrophoresis and the gels were 
coomassie stained to visualize proteins. In cells engineered to overexpress panE 
(PA236-1 and P A236-2), a band was visible having an approximate molecular weight of 
-31,000 daltons (as compared to molecular weight markers). Moreover, PA221 and 
25 PA236 expressed increased levels of a -29,000 dalton band, corresponding to ihepanB 
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gene product, and a -39,000 dalton band, presumably corresponding thepa/iC gene 
product. Furthennore, E. coli transformed with pAN006 (Figure 3B)-expr-essed bands 
correlating to the panB and panCgene products and £. coli transfected with PAN236 
expressed a -3 1 ,000 dalton band corresponding to the panE gene product. 

5 Next, PA236 was compared to PA221 carrying the empty vector pOTP61 for 

pantothenate production in liquid test tube cultures supplemented with 5 g/L ^-alanine 
and 5 g/L a-KIV. 

Table 5. Effect of overexpression of panEl and panE2 on pantothenate 
10 production by engineered strains in liquid test tube cultures. 



Strain 


Additional 


Gene 


(Pantothenate] 




Plasmid 


Overexpressed 


mg/L 


PA221 


pOTP61 


none 


1,000 








940 


PA236 


pAN236 


panEl , 


2,030 








2,050 


PA238 


P AN238 


panE2 


530 








680 



Overexpression of panEl caused a two-fold increase in pantothenate production 
when compared to the parent strain (e.g., to slightly over 2 g/L) whereas overexpression 
of panE2 resulted in a strain that produced about 35% less pantothenate than the parent 
1 5 strain. The panEl nucleotide sequence and predicted amino acid sequence are set forth 
as SEQ ID NO:29 and SEQ ID NO:30. 

EXAMPLE III: Enhanced Production of a Panto-Compound by Culturing 
Bacteria Overexpressing panEl or panBCD in the Presence of Valine. 

20 The ability of valine to function as a media supplement (e.g., as a substitute for 

a-KIV) in strains engineered to overexpress the panBCD operon and panEl was 
evaluated. Valine is closely related to a-KIV by transamination, is less -expensive than 
a-KIV, and is commercially available in kilogram quantities. Valine was -substituted for 
a-KIV in the standard liquid test tube cultures in SVY medium. The concentration of 

25 valine was varied from 5 to 50 g/L. Although valine at 5 g/L was slightly less efifective 
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than a-KTV in promoting pantothenate production, valine at 10 or 20 g/L equaled or 
surpassed 5 g/L a-KTV in promoting pantothenate production. 

EXAMPLES IV-X Generation of Microorganisms Capable of Producing 
5 Pantothenate in a Precursor-Independent Manner 

B. subtilis strains such as PA221 and PA235 (engineered to overexpress 
panBCD) and PA236 (engineered to overexpress panBCD endpanEl) need to be fed a- 
ketoisovalerate (a-KIV) (or valine) and aspartate (or p-alanine) to achieve maximal 
pantothenate production, as both these precursors are limiting for pantothenate 
1 0 synthesis. Accordingly, manipulated microorganisms were designed to eliminate the 
need to feed limiting precursors of pantothenate biosynthesis in the production of 
pantothenate. These strains are also useful in the production of various pantothenate 
biosynthetic pathway intermediates. 

1 5 EXAMPLE IV: Generation of Microorganisms Capable of Producing 
Pantothenate in an Aspartate- (or JJ-Alanine) Independent Manner 

The panD gene was cloned into B. subtilis expression vector pOTP61 to 
construct pAN423 (Figure 5). The nucleotide sequence of pAN423 is set forth as SEQ 
ID NO:78. The Notl restriction fragment containing panD was isolated from pAN423, 
20 self ligated and used to transform PA221 . Transformants resistant to Tet 15 , Tet 30 , and 
Tet 60 were isolated and saved for further analysis. 

Six of the pAN423 transformants plus two control transformants were grown in 
SVY containing 5 g/1 a-KTV with and without 10 g/1 aspartate and then assayed for 
pantothenate production (Table 6). 

25 

Table 6. Effect of overproducing PanD on pantothenate production with and without 
added aspartate. 



Culture* 
(PA221 transformants) 


Asp 
(10 g/L) 


TetR** 
(Hg/ml) 


OD550 


[pan] 

(mg/L) 












pOTP61-l 




60 


8.0 


76 


pOTP61-2 




60 


7.7 


91 


423#1-1 




15 


8.5 


180 


423#l-2 




15 


8.0 


150 


423#l-3 




30 


8.3 


220 


423#l-4 




30 


8.5 


280 


423#l-5 




60 


8.9 


580 


423#l-6 




60 


8.8 


280 
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pOTP61-l 


+ 


60 


7.5 


380 


pOTP61-2 




ou 


£ Q 

O.y _ 


jOU 


423#M 


+ 


IS 


8.5 


1 1200 


423#l-2 


+ 


15 


8.6 


1000 


423#l-3 " 


+ 


30 ' 


8.8] 


1200 


423#l-4 




30 


9.0] 


1200 


423#l-5 


+ 


60 


9.0 ; 


1200 


423#l-6 


,+ 


60 


9.0 


1200 



*Test tubes cultures were grown in SVY + a-KTV <5 g/L) with Asp (10 g/L) where 
indicated. 

**TetR = Approximate Tet-resistance of transformant 



The pAN423 transformants produced at least twice the amount of pantothenate 
5 as the controls (f. e. , to a level at or near that which was obtained in earlier experiments 
by the addition of P-alanine to the culture medium). The data also show that in the 
absence of added aspartate, transformants containing additional copies of the panD gene 
expression cassette produce more pantothenate than the-control transformants. One of 
the transformants, 423#l-5, produced about five times as much pantothenate as the 
10 , controls. These results indicated that increased levels of PanD protein "pull" the 
conversion of available aspartate towards ^-alanine, and that increasing panD gene 
expression can result in enhancement of pantothenate production both in the presence 
and absence of added aspartate. 

Transformant 423#l-5 was re-named strain PA401 and studied further in shake 
1 5 flask fermentations. The shake flask medium was SVY with maltose instead of SVY 
with glucose. Results of shake flask experiments agreed well with test tube experiments 
during the first 24 hours. In shake flask experiments without the addition of p-alanine, 
PA401 produced approximately 1 .5 g/1 of pantothenate in 24 hours. Addition of 
alanine to the culture medium did not further improve pantothenate titers {Table 7), 
20 indicating that with this strain and these fermentation-conditions, ^-alanine is not 

limiting pantothenate production. In fact, when no P-alanine is fed, one can observe that 
PA401 is secreting P-alanine in significant amounts into the medium. 



BNSOOCtO: <WO 0121772A3 IA> 



WO 01/021772 PCTAJS00/25993 

-71- 

Table 7. Shake flask cultures with strain PA401 (panD) with and without B-alanine. 





Amino acids (g/l) 


24 hours 


Initial 

p-ala 
Added 




Val 


pH 


OD600 


Pantothenate 
(g/D 


0 


0.7 


1.5 


7.5 


13.7 


1.5 


5g/I 


7.1 


1.4 


7.6 


12.4 


1.5 



Each value represents the average of duplicate 250 ml baffled flasks containing 50 ml of 
5 medium, incubated at 37°C with shaking (200 rpm). 
Base Medium: SVY with 10 g/l cc-KIV, 30 g/l maltose 
2% Inoculum: SVY with Tet 15 grown 24 hours. ' 

EXAMPLE V: Engineering the panD gene for Further Increased Synthesis of 
10 Aspartate Decarboxylase and Enhanced Production of Pantothenate 

This Example describes the generation of improved ribosome binding sites 
(RBSs) in the panD gene to increase the translation of panD mRNA. 
Increasing the translation of the panD Rene mRNA by generation of synthetic panD 
RBSs 

1 5 The RBS (SEQ ID NO:88) used to express panD in pAN423 is a synthetic RBS 

and has been used to successfully produce other proteins in B. subtilis at a high level. 
However, it contains six mismatches when aligned to the "ideal" B. subtilis RBS (SEQ 
ID NO:45) {e.g. , an RBS having a sequence which is complementary to the 16S RNA 
sequence within the B.subtilis ribosome). (See e.g. , Table IB, mismatches in bold). 

20 Two new RBSs were designed to more closely mimic the ideal RBS. These synthetic 
RBSs, named new design A (NDA) and new design B (NDB) <also referred to herein as 
RBS3 and RBS4), are set forth as SEQ ID NO:51 and SEQ ID NO.S2 and are aligned 
with the ideal RBS in Table IB. 

Oligonucleotides corresponding to the top and bottom strands of each new RBS 

25 were synthesized, annealed, then used to replace the RBS in pAN420, generating 
plasmids pAN426 and pAN427. These constructions are illustrated in Figure 6. The 
presence of the NDA and NDB RBS in pAN426 and pAN427 was comnrmed by DNA 
sequence analysis. Next, the panD genes from pAN426 and pAN427 were transferred 
to B. subtilis expression vector pOTP61 as shown in Figure 7, creating pAN428 and 
30 pAN429. The nucleotide sequence of pAN429 is set forth as SEQ ID NO:79. 
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Notl restriction fragments lacking the E. coli vector, sequences were isolated 
from P AN428 and pAN429, self-ligated, and used to transform strain PA221 to 
resistance to Tet 15 . Four isolates resistant to Tet 60 were picked from each transformation 
and assayed for pantothenate and p-alanine production along with PA221 transformed 
with the empty vector <pOTP61) and PA221 transformed with pAN423 {strain PA401) 
(see Table 8). 



10 



Table 8. 



Plasmid 



Panthothenate production by test tube cultures ofPA221 transformed 
with pAN428 and pAN429 



Medium 
Supplements 



OD550 



Pan 

■sn 



g/1 



pOTP61 
pAN423 

pAN428-l * 
pAN428-2 
pAN428-3 
pAN428-4 

pAN429-l 
pAN429-2 
pAN429-3 
pAN429-4 # 



a-KTV J 
a-KIV 5 

a-KJV 5 
a-KIV 5 
a-KIV 
a-KIV 5 

a-KIV 5 
a-KIV 
a-KIV 5 
a-KIV 5 



10 
10 

12 
11 
11 
10 

12 
11 
11 
12 



0.4 

0.6 
0.5, 
0.3 
0.1 

0.6 
0.5 
0.6 
0.8 



0:04 
0.04 

0.04 
0.03 
0.03 
UND 

0.04 
0.04 
O.05 
0.10 



pOTP61 
pAN423 

pAN428-l * 
pAN428-2 
pAN428-3 
pAN428-4 

pAN429-l 
pAN429-2 
pAN429-3 
P AN429-4 # 



a-KIV 5 + Asp 10 
a-KTV 5 + Asp 

a-KIV 5 + Asp 10 
5 10 
a-KTV 0 + Asp 1 " 

a-KTV 5 + Asp 10 

a-KIV 5 + Asp 10 

a-KTV 5 + Asp 10 
„„,5 . a 10 
a-KIV + Asp 

5 10 
a-KTV^ + Asp " 

a-KTV + Asp 



11 
12 

12 
12 
12 
11 

12 
12 
12 
13 



0.5 
0.9 

0.8 
0.8 
0.9 
0.8 

0.8 
0.8 
0.8 
0.8 



0.08 
1.32 

1.97 
1.51 
1.02 
0.30 

1.78 
1.66 
1.78 
2.28 



UND: Below the 



units of detection. * Renamed PA402 # Renamed PA403 



When grown in medium supplemented with a-KTV at 5 g/l<a-KI V 5 ), the 
pAN428-l transformant and all four of the pAN429 transformants produced more 
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pantothenate than did PA401, suggesting that these transfonnants contain higher levels 
of aspartate decarboxylase activity. When grown in medium supplemented with a-KIV 5 
and Asp 10 none of the pAN428 or pAN429 transfonnants produced more pantothenate 
than PA401 . However, the pAN428-l transformant and all four of the pAN429 
5 transfonnants produced significantly more p-alanine than did PA401 . It is possible that 
the excess ^alanine produced from added aspartate causes inhibition of pantothenate 
production. Alternatively, P-alanine may accumulate because pantoate is limiting in 
these strains. 

The strains that produced the highest level of P-alanine, the pAN428-l and 
1 0 pAN429-4 transfonnants, were renamed PA402 and PA403, respectively. These two 
strains were grown in SVY medium supplemented with various intermediates and 
reassayed for pantothenate and p-alanine production. PA221 and PA401 were included 
as controls. The results of the assays are presented in Table 9. 

1 5 Table 9. Pantothenate production of PA402 and PA403 in test tube cultures. 



Strain 


Medium 
Supplements 


OD55O 


Pan 

en 


p-Ala 


Val 

g/1 


PA221 
PA401 
PA402 
PA403 


a-KTV 5 
a-KTV 
a-KIV 
a-KIV 5 


7.9 
8.7 
8.5 
9.4 . 


UND 
0.3 
0.5 
0.7 


UND 
0.04 
0.04 
0.07 


0.9 
0.9 
0.9 
0.9 


PA221 
PA401 
PA402 
PA403 


a-KTV 5 + Asp|° 
a-KTV 5 + Asp™ 
a-KTV 5 + Asp 
a-KTV 5 + Asp 10 


9.8 
9.1 
9.4 
9.7 


0.4 
0.8 
0.8 
0.7 


0.11 
1.15 
2.02 
2.40 


0.8 
0.8 
0.8 
0.8 


PA221 
PA401 
PA402 
PA403 


Pantoate 5 
Pantoate^ 
Pantoate^ 
Pantoate 


8.9 
8.7 
10.6 
10.5 


UND 
0.3 
0.5 
0.7 


UND 
0.02 
0.02 
0.02 


0.2 
0.2 
0.2 
0.2 


PA221 
PA401 
PA402 
PA403 


Pantoate 5 + Asp jjj 
Pantoate. + Asp^ 
Pantoate + Asp 1Q 
Pantoate +Asp 


9.5 
9.2 
9.1 
10.2 


0.4 

2.2 
2.8 
2.9 


0.06 
0.62 
1.17 
1.58 


0.2 
0.2 
0.2 
0.2 
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UND: Below the limits of detection. 

When grown in medium supplemented with either a-KTV 5 or Pantoate 5 , PA402 
and PA403 produced significantly more pantothenate than did PA401. As before, even 
though PA402 and PA403 produced significantly more (5-alanine than PA401 when 
5 grown in medium supplemented with a-KTV 5 and Asp 10 , they did not produce a 

proportional increase in pantothenate. However, when grown in medium supplemented 
with Pantoate 5 plus Asp 10 , both P A402 and P A403 produced significantly more 
pantothenate than PA40 1 , about a 30% increase. 

It can be concluded from these experiments that the improved NDA and NDB 
1 0 panD ribosome binding sites, engineered into pAN428 and pAN429, respectively, lead 
to increased levels of aspartate decarboxylase activity. 

Increasing the translation of the panD gene mRNA by generation of synthetic panD 
RBSs within the panBCD operon 

15 

The native B. subtilis panD gene ribosome binding site (RBS)<SEQ ID NO:43), 
which is found in the P 2 6panBCD operon cassette present in PA221 (and in other 
engineered pantothenate production strains described herein), is shown in Table 1C 
aligned with the ideal ribosome binding site<SEQ ID NO:47). The alignment shows 

20 mismatches between the native 5. subtilis panD^gvne RBS, which is located within the 
coding sequence for PanC, and the the ideal RBS. Three new RBSs {within the P26 
panBCD operon cassette) were generated to increase translation of the panD gene , 
mRNA and to yield increased synthesis of aspartate decarboxylase. These synthetic 
RBSs (termed NDI, NDH, and NDIII, also referred to herein as RBSS, RBS6 and RBS7, 

25 respectively) are set forth as SEQ ID NO:55, SEQ ID NO:56 and SEQ ID NO:57, 

respectively) and are included in Table 1C. It should be noted that although changes in 
the panD RBS within the panBCD operon also changes the C-terminal amino acid 
sequence of the PanC protein encoded by that operon, an alignment of known and 
suspected PanC protein amino acid sequences showed that the sequence of the last nine 

30 amino acids of the B. subtilis PanC protein could be altered without affecting any 
conserved amino acid residues indicating that such changes should not reduce 
pantothenate synthetase activity or expression. The new RBSs were synthesized and 
incorporated into the P26 panBCD operon expression cassette as follows. 

First, PCR primers were designed to contain the following elements: (1) a 
35 nucleic acid sequence encoding the first five amino acids of PanD up to and including a 
unique BsiWl restriction site that had been previously introduced into panD by PCR;<2) 
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a stop codon for panC, (3) at least one synthetic RBS; and (4) 30-39 bp of nucleic acid 
sequence having 100% identity with panC upstream of the panD RBS. The primers 
were named TP102, TP103, and TP104 and contain the NDI, NDII, and NDIII ribosome 
binding sites, respectively. These three primers were used in conjunction with the 5' 

5 primer TP101 , which hybridizes near the start codon of panC, in three independent PCR 
reactions to generate the NDI, NDII, and NDIII PCR products. The PCR products were 
purified, digested with Xbal, then cloned into plasmid vector pASK-lBA3 which had 
been digested with Xbal and Smal. The resulting plasmids were named pAN43 1 , 
pAN432, and pAN433. The construction of pAN431 is illustrated in Figure 8 and is 

10 representative of all three plasmid constructions. The presence of the desired synthetic 
panD gene RBS in each new plasmid was confirmed by DNA sequencing. 

Next, the modified panC genes containing the new panD RBSs were joined with 
the panD gene utilizing the unique BsiVfl restriction site. This was accomplished by 
isolating the appropriate Nsil-BsiWl restriction fragments from pAN431, pAN432, and 
1 5 pAN433 and ligating them with a 2395 bp NsiL-BsiWl restriction fragment from 

pAN420, which supplied the As/WI-modified panD gene. These constructions resulted 
in plasmids pAN441, pAN442, and pAN443, respectively. A representative 
construction (pAN441) is illustrated in Figure 9. The nucleotide sequence of pAN443 is 
setforthasSEQIDNO:80. 

20 The new panD gene RBSs were then substituted into the P 2 ^panBCD operon 

expression cassette as follows. First, a deletion-insertion mutation which removes the 
region of panC containing the panD RBS was created. This was constructed by 
digesting pAN430 with a mixture of BspEl and BgRl and recovering the 4235 bp 
fragment which is now missing the 3 1 end of panC and the 5 1 end of panD. This 

25 fragment was ligated with an Aval-Bamltl restriction fragment from plasmid pECC4, 
which contains the chloramphenicol acetyl transferase (cat) gene. The 5' extension 
produced by Aval digestion is compatible with that produced by BspEI while the Bglil 
and BaniHI extensions are also compatible. The resulting plasmid was named pAN440, 
and its construction is illustrated in Figure 10. 

30 The resulting deletion-insertion mutation was crossed into the P26 panBCD 

operon via homologous recombination by transforming PA221 with linearized pAN440 
and selecting for resistance to chloramphenicol on Cam 5 plates containing 1 mM 
pantothenate. Several transformants were tested, and were all found to require 1 mM 
pantothenate for growth, as expected. Two of these transformants were remaned 

35 PA408A and PA408B and were assayed for pantothenate production. Neither strain 
synthesized measurable quantities of pantothenate, even when grown in medium 
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containing pantoate and P-alanine at 5 g/1, indicating that the strains are deficient in 
pantothenate synthetase activity. Next, the new panD RBSs were crossed into the P 2 e 
panBCD operon by transforming PA408 with linearized pAN441, pAN442, and 
pAN443 plasmid DNA and selecting for growth on TBAB plates without pantothenate 
5 supplementation. A transformation with linearized pAN430 "(including the native panD 
RBS) was included as a control and was expected to give rise to transformants identical 
to PA221 described herein. Four isolates from each transformation were assayed for 
pantothenate and p-alanine production in SVY medium supplemented with various 
intermediates (Tables 10 and 1 1). 

10 

Table 10. Pantothenate production of PA4W - PA413 in test tube cultures. 



Strain 


RBS 


Medium 


ODsSO 


Pan 


■P-Ala 


i 




Supplements 






g/1 


r/\ZZ 1 

» 


. 

native 


IT <Xxi\\)<Xv\^ 


11 


UND 


UND 


PA410-1 


native 


5 

Pantoate^ 


12 


UND 


UND 


PA410-2 




Pantoate^ 


12 


UND 


UND 


PA410-3 




Pantoate^ 


12 


UND 


UND 


PA410-4 




Pantoate 


12 


UND 


UND 


PA411-1 


NDI 


Pantoate^ 


12 


0.23 


UND 


PA411-2 




Pantoate^ 


12 


0.20 


UND 


PA411-3 




Pantoate^ 


12 


0.19 


UND 


PA4U-4 




Pantoate 


12 


UND 


UND 


PA412-1 


NDII 


Pantoate,^ 


12 


UND 


UND 


PA412-2 




Pantoate^ 


11 


UND 


UND 


PA412-3 




Pantoate^ 


13 


0.18 


UND 


PA412-4 




Pantoate 


12 


0.18 


UND 


PA413-1 


ND1II 


Pantoate^ 


. 12 


0.18 


UND 


PA413-2 




Pantoate^ 


12 


0.17 


UND 


PA413-3 




Pantoate^ 


12 


0.16 


UND 


PA413-4 




Pantoate 


12 


0.17 


UND 



UND: Below the limits of detection. 



15 
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Table 11. Pantothenate production ofPA410 - PA413 in test tube cultures. 



Strain 


RBS 


Medium 
Supplements 


OD55O 


Pan 


p-Ala 
g/1 


PA221 


native 


Pamoate 5 + Asp 10 


11 


0.3 


UND 


PA410-1 

PA/11H 9 

PA410-3 
PA410-4 


native 


Pantoatef. + Aspj^ 

Pantnutf* -4- A Qfl 

Pantoate- + Asp^ 
Pantoate +Asp 


12 
12 
12 
12 


0.4 
0.4 
0.4 
0.4 


UND 
UND 
UND 
UND 


PA411-1 

t) A /111 O 

rA41 1-Z 

PA411-3 
PA411-4 


NDI 


Pantoate^ + Asp jg 

JrolliOaie^ • /\sp^Q 

Pantoate + Asp 1Q 
Pantoate +Asp 


13 
13 
13 
13 


1.7 
1.7 
1.8 
0.4 


0.4 
0.4 
0.3 
UND 


PA412-1 
PA412-2 
PA412-3 
PA412-4 


NDII 


Pantoate^ + Asp jjj 

Pantoate- + Asp in 
5 10 
Pantoate^ + Asp ^ 

Pantoate + Asp 


13 
12 
12 
12 


0.4 
0.4 
1.6 
1.5 


UND 
UND 
0.3 
0.2 


PA413-1 
PA413-2 
PA413-3 
PA413-4 


NDIII 


Pantoate^ + Asp 
Pantoate 5 + Asp 10 
Pantoate 5 + Asp 1Q 
Pantoate +Asp 


13 
13 
13 
13 


1.6 
1.6 
1.7 
1.7 


0.3 
0.4 
0.4 
0.4 



UND: Below the limits of detection. 



5 As expected from previous experiments using P A22 1 , none of the transformants 

that contained the native panD RBS produced measurable quantities of pantothenate 
when grown in medium supplemented with pantoate. However, nine of the twelve 
transformants expected to contain modified panD RBSs produced significant quantities 
of pantothenate (160-230 mg/1) under these conditions, indicating that they possess 

1 0 elevated levels of aspartate decarboxylase activity. When grown in medium 

supplemented with both pantoate and aspartate, these same nine transformants produced 
approximately four times more pantothenate than those with the native panD RBS. In 
addition, these nine transformants accumulated measurable quantities of fJ-alanine (230- 
410 mg/1). All transformants produced roughly equivalent quantities of pantothenate 

1 5 when grown in medium containing pantoate and p-alanine, demonstrating that each 
contains a functional pantothenate synthetase. 



WO 01/021772 



i 



PCIYUS00/25993 



-78- 

These data demonstrate that the synthetic panD RBSs are about four times more 
effective than the native panD RBS in directing translation of thepanflgene mRNA and 
evidence the utility of such synthetic RBSs in^enhancing pantothenate production. 
Additional approaches to increasing pantothenate production can include, for example, 
5 increasing the half-life of the panD gene mRNA, increasing the strength of the promoter 
for panD transcription and/or increasing the stability of the PanD protein. 

EXAMPLE VI: Construction of Strains Containing an Integrated P 26 panEl 
Cassette without an Antibiotic Resistance Gene. 

10 

Example II describes the identification of the B. subtilis panEl gene that encodes 
the enzyme responsible for the majority of the ketopantoate reductase activity in B. 
subtilis. PA236 (containing the pAN236 plasmid) produced about twice as much 
pantothenate (2g/l) as its parent strain, PA221 (1 g/1) in 24 hour SVY test tube -cultures. 
15 PA236 was presumed to contain an amplified (~3 copies) integrated pAN236 plasmid 
based on selection for tetracycline resistance (the tetR gene product being encoded on 
the pAN23 6 plasmid in addition to the P 26 panEl cassette). Also useful in the 
methodologies of the present invention are strains that contain a single integrated 
unamplifiable copy of P 26 panEl at the panEl locus, for example, without an antibiotic 
20 resistance gene in the strain. Such a strain was generated as follows. 

A plasmid named pAN251 was derived frompAN236 by inserting additional 
chromosomal sequences just upstream and just downstream from the P 26 panEl cassette. 
These additional sequences, which provide homology to allow integration of the P 26 
panEl cassette at panEl by double crossover, were obtained by PCR from chromosomal 
25 DNA as a template. pAN251 is shown in Figure 1 1 . The nucleotide sequence of 
pAN251 is set forth as SEQ ID NO:81. 

Next, a strain was constructed which allowed selection for the incoming P 2 e 
panEl cassette. The strain included the following three components: (1) PupanBCD; 
(2) ApanEl ; and (3) ilvC, since both panEl and i/vC must be mutated to have a Pan" 
30 phenotype. The starting strain was CU550 {trpC2, ilvC4, 4euC124). The P 26 panBCD 
cassette from PA221 chromosomal DNA was introduced in two steps to create strain 
PA290. Next, ApanEl r.spec was transformed into PA290, using chromosomal DNA 
from strain PA240, to give strain PA294 (trpC2, ilvC4, leuC124, P 26 panBCD, 
ApanEl r.spec), which is a strict pantothenate auxotroph. Finally, PA294 was 
35 transformed with plasmid pAN251, selecting for pantothenate prototrophy, to give strain 
PA303. This strain was expected to have the genotype trpC2, ilvC4, leuC124, P 2 e 
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panBCD, P 2 e panEl. PA303 was checked for the correct chromosomal structure at the 
panEl locus by PCR using primers that flank the P 2 6 insertion just upstream of panEl. 
The PCR product from PA303 was of the expected size, with a concomitant loss of the 
PCR product from the wild type panEl gene, consistent with having obtained the 
5 ' desired double crossover event. Furthermore, PA303 was tetracycline sensitive, which 
is also consistent with the desired double crossover event, as opposed to a Campbell- 
type single crossover of the plasmids into the chromosome. The trp, tfv, and leu 
auxotrophies from the parent strain were all maintained in PA303 . 

In 24 hour liquid SVY test tube cultures, PA303 produced almost the same level 
10 of pantothenate as positive control PA236, and about twice as much as PA221, which 
does not contain engineered panEl as indicated in Table 12. 

Table 12, Pantothenate production by 24 hr. test tube cultures ofPA303 and 
consols grown in SVY plus 5 g/l a-KIVand 5 g/l fi-alanine. 



Strain 


OD600 


[pan] g/l 


PA221-1 


10.9 


0.85 


PA221-2 


10.5 


0.85 


PA236-1 


9.5 


1.74 


PA236-2 


9.3 


1.70 


PA303-1 


10.8 


1.66 


PA303-2 


10.7 


1.61 



EXAMPLE VII: Generation of Microorganisms Capable of Producing 
Pantothenate in an a-KIV (or Valine) Independent Manner 

20 a-ketoisovalerate (a-KIV) is a rate limiting intermediate for pantothenate 

production in certain strains deregulated for pantothenate synthesis. Addition of either 
a-KTV or valine at 5 g/l increases pantothenate production about 5-fbld in test tube 
cultures with strains such as PA221 . In order to alleviate the need to feed either a-KIV 
or valine, strains were engineered that have an increased capacity to synthesize a-KTV. 

25 a-KTV is produced in B. subtilis from pyruvate by the sequential action of three 

enzymes encoded by four genes, ilvB and ilvN, ilvC, and ilvD. In a wild type B. subtilis, 
three of the genes {ilvB, ilvN, and ilvQ are the first three genes of the large ilv-leu 
operon. The fourth gene necessary for a-KIV synthesis, z7vZ), is located by itself 
elsewhere on the chromosome. Hie B. subtilis ilv-leu operon is thought to be regulated 
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only by leucine levels. Feeding of exogenous leucine reduces transcription of the ilv-ku 
operon by about 13-fold, probably by an attenuation mechanism (Grandoni et al. (1992) 
J. Bacteriol. 1 74: 3212-3219). The only known feedback regulation in the ilv-leu 
pathway is the inhibition of the leuA gene product by leucine. 

5 As a first step to deregulate the synthesis of a-KTV, a copy of the ilvBNC region 

from the wild type B. subtilis ilv-ku operon was isolated by PCR, and installed adjacent 
to the P 26 promoter and RBS2 on a vector, pOLL8, that was designed to integrate a 
single ? 2 tf expression cassette by double recombination at the amyE locus. The amyE 
gene encodes a nonessential a-amylase, and is a useful locus for installing expression 

10 cassettes. The resulting plasmid, pAN267, is illustrated in Figure 12. The nucleotide 
sequence of pAN267 is set forth as"SBQ ID NOS2. pAN267 readily gave stable 
transformants by double crossover at the amyE locus of B. subtilis strains, as described 
in detail below. 

Construction of pantothenate overproducing strains that are leucine. prototrophs 

1 5 Initially, a B. subtilis strain containing ilvC4 and ApanEl was used to introduce a 

single copy of P 26 panEl into the chromosome without using an antibiotic resistance 
gene. The double mutant was required to select for the incoming P 26 panEl cassette 
because a ApanEl mutation alone does not result in pantothenate auxotrophy. A strain 
named CU550 was obtained containing ilvC4 to be used as a basis for this type of strain 

20 construction. However, CU550 also contains a closely linked leuC124 mutation, so all 
strains derived from CU550 required leucine. Having shown mat the combination of P 2 6 
panBCD and P 26 panEl was favorable for pantothenate production, the next step was to 
reassemble this combination of two cassettes in a leucine prototroph. 

Accordingly, die two cassettes were combined in two different strain 
25 backgrounds, RL-1 and PY79. To introduce chromosomal P 26 panEl into the PY79 and 
RL-1 strain backgrounds without using an antibiotic resistance gene, a strategy was used 
that did not rely on ilvC4. (The strategy took advantage of the observation that the 
ApanEl mutation causes a pantothenate bradytrophy, manifested by relatively small 
colonies on TBAB (rich) plates). First, ApanBr.cat and ApanE: .spec were introduced 
30 into both strain backgrounds. Next, the resulting strains were transformed 

simultaneously with DNA from two strains, PA221 (P 26 panBCD) and PA303 {Pu 
panEl), selecting for Pan* on TBAB plates. Colonies of two distinct sizes grew on the 
selective plates, with the larger size comprising about 2% of the colonies. The larger 
colonies were presumed to represented co-transformants mat received both P 26 panBCD 
35 and P 26 panEl, and that the smaller-colonies had-received-only P 26 panBCD. Consistent 
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with this prediction, the larger colonies had lost both Cam r and Spec', while the smaller 
colonies had lost only the cat gene, and retained the spec gene. Furthermore, a 
representative derivative of PY79 named PA327, and a representative derivative of RL- 
1, named PA328, both produced the elevated levels of pantothenate in test tube cultures 
5 which was about 1.6 to 1.7 g/1 (Table 13). 

Table 13. Pantothenate production of PA327, PA328, and controls from 24 hr 

test tube cultures grown in SVYplus 5 g/l a-KJV and fi-alamne. 



P26P<*nEl ropy 



Strain 


Background 


number 


[pan] g/I 


PA221-1 


RL-1 


0 


0.92 


PA221-2 


RL-1 


0 


0.95 


PA236-1 


RL-1 


amplified (~3) 


1.60 


PA236-2 


RL-1 


amplified (~3) , 


1.73 


PA327-1 


PY79 


1 


1.66 


PA327-2 


PY79 


1 


1.65 


PA328-1 


RL-1 


1 


1.61 


PA328-2 


RL-1 


1 


1.91 



10 

Thus, PA327 and PA328 were concluded to contain both P 2 6 panBCD and P 2 6 
panEl, and were used for further constructions as described below. PCR analysis 
confirmed the presence of the two cassettes. 

15 Installation of a stable P26 ilvBNC cassette into two lineages of pantothenate 
overproducing strains 

Having constructed PA327 and PA328, derivatives of PY79 and RL-1 that 
contain P26 panBCD and P26panEl, and that are Leu + , the next step was to introduce 
stable copies of P26 ilvBNC. This was accomplished by transforming PA327 and 
20 PA328 with plasmid pAN267, selecting for Spec r . Screening by PCR showed that about 
85% of the obtained transfonnants contain P26 ilvBNC integrated at amyE by double 
crossover. One transformant of PA327, named PA340, and one transformant of PA328, 
named PA342, were chosen for further study. 

In test tube cultures grown in SVY medium plus 5 gA p-alanine but without 
25 added a-KIV, both PA340 and PA342 gave the expected increase in pantothenate 
production over that of PA327 and PA328, to about 1 .3 to 2 g/l (Table 14). 
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Table 14. Pantothenate and valine production by PA340 and PA342, both 

containing? 2 $ilvBNC in 24 hr test tube cultures grown in SVYwith 5^/1 
^-alanine and with or without 5 g/l a-KIV 



Strain 


Back- 
ground 


OD600 
-a-KIV 


+ a-KIV 


{pan] g/l 
-a-KIV 


+ a-KIV 


{val].g/l 
-a-KIV 


+ a-KW 


PA340-1 


PY79 


11.8 


7.1 


2.02 


2.10 ' 


0.38 


0.90 


PA340-2 


PY79 


10.3 


7.5 


1.97 


2.03 


0.40 


0.91 


PA342-1 


RL-1 


10.2 


8.0 


1.29 


1.89 


0.27 


0.78 


PA342-2 


RL-1 


9.6 


9.2 


1.34 


2.04 


0.21 


0.79 



The two new strains also gave aslight increase in valine secretion, indicating 
that the ilvBNC genes had been deregulated. However, when the same strains were 
grown with 5 g/l a-KIV added, a further increase in pantothenate production occurred 
1 0 from P A342, suggesting that a-KIV was still rate limiting in this strain background. 
Similar results, only with more growth and hence higher pantothenate levels, were seen 
in shake flask cultures (Table 15). 

Table 15. Pantothenate and valine production by PA340 and PA342 t both 
15 containing P 26 ilvBNC in 24 hour shake flask cultures grown in SJT with 

5 g/l ^-alanine and with or without 5 g/l a-KIV. 



Strain 


Back- 
ground 


OD600 
-a-KIV 


+ a-K!V. 


{pan] g/l 
-a-KIV 


+ a-KTV 


Ival] g/l 
-a-KIV 


+ a-KIV 


PA327 


PY79 


21 


22 


0.6 


3.0 


0.5 


1.3 


PA340-1 


PY79 


20 


20 


3. "5 


4.1 


1.0 


1.9 


PA340-2 


PY79 


22 


19 


3.0 


2.1 


0.8 


1.4 


PA328 


RL-1 


20 


16 


1.4 


2.7 


0.6 


1.3 


PA342-1 


RL-1 


17 


16 


3.3 


3.6 


0.9 


1.6 


PA342-2 


RL-1 


18 


18 


3.1 


4.2 


0.8 


1.4 
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EXAMPLE VIII: Increasing panD Copy Number in Strains Engineered to 
Overproduce panEl and the ilvBNC Gene Products Enhances Pantothenate 
Production 

Experiments where P-alanine was fed to cultures of engineered B. subtilis strains 
5 consistently showed that P-alanine was a rate limiting intermediate in pantothenate 

synthesis. The effect of adding additional copies of panD on pantothenate production in 
PA340andPA342 was examined. Strains PA340 and PA342 were transformed with 
chromosomal DNA isolated from PA401 with selection on plates containing 15 |ig/ml of 
tetracycline (Tet 15 plates). Transformants derived from each parent were patched onto 

1 0 Tet 60 plates to identify those which were likely to contain multiple copies of the 

expression cassette. Twelve transformants from each transformation which grew on 
Tet 60 were streaked for single colonies on this medium and then assayed in SVY 
medium test tube cultures for pantothenate production. One transformant from each 
group was found to produce greater than 300 mg/1 pantothenate in 24 hours. These two 

15 . transformants were saved and named PA404 (PA340 strain background) and PA405 

(PA342 strain background). Both strains were resistant to spectinomycin, indicating that 
the P26 ilvBNC expression cassette was still present at amyE. PCR analysis of 
chromosomal DNA isolated from each strain confirmed that the deregulated panEl gene 
had also been retained. 

20 . Next, PA404 and PA405 were evaluated in shake flask cultures which were 

grown in SVY medium containing maltose as the carbon source and supplemented with 
various intermediates. The cultures were grown for 24 and 48 hours and then assayed 
for pantothenate, P-alanine, and valine production. The results of this experiment are 
presented in Table 16. Analogous shake flask culture data for the parent strains (PA340 

25 and PA342) are included in the tables for comparison. 

Table 16. Pantothenate production by PA404 and PA405 in shake flask cultures 
after 24 hours 



Strain 


Medium 
Supplements 


OD600 


Pan 

g/1 


P-Ala 

g/1 


Val 

g/1 


PA340 


none 


20 


0.4 


<0.1 


1.0 


PA404 


none 


22 


1.8 


O.l 


0.7 


PA342 


none 


19 


0.3 


0.2 


0.7 


PA405 


none 


19 


1.4 


0.4 


0.5 
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PA340 
PA404 
PA342* 
PA405* 



P-alanine 
P-alanine 
P-alanine 
P-alanine^ 





18 


3:6 


32 


0.6 




18 


2.8 


5.1 


0.7 




17 


3.3 


3.3 


0.5 




19 


1.3 


6.5 


0.6 



Values are the average of duplicate flasks -except where indicated -by * 



In the absence of any medium supplementation, PA404 and PA405 made four to 
five times more pantothenate in 24 hours compared to their isogenic parent strains 
5 (Table 16). The supply of P-alanine was clearly limiting in the parent strains PA340 and 
PA342. Addition of amplified P26 panD greatly increased the supply of P-alanine. 



EXAMPLE IX: Deregulation of the B. subtitis ilvD Gene Enhances Pantothenate 
Production 

10 To deregulate expression of the ilvD gene^tandard procedures (described 

above) were used to integrate the constitutive P 2 6 promoter and an artificial ribosome 
binding site, RBS2, just upstream of the ilvD coding region. Hie i/vDgene maps by 
itself, unlinked to the ilvBNC pperon. First, a 2.4 kb region of the RL-1 chromosome 
that contains the ilvD coding region and 730 bp of upstream sequence was cloned by 

1 5 PCR into a low copy (about 15 per E. coli cell) vector called pOK12, to give plasmid 
pAN257, shown in Figure 13. 

Taking advantage of a natural £coRI^ite just upstream of the native ilvD gene 
promoter, and a natural Ncol site at the ilvD start codon, an artificial sequence 
containing P 26 and RBS2 was inserted into pAN257 to give pAN263 (Figure 14). The 

20 nucleotide sequence of pAN263 is set forth as^EQ ID NO:83. In parallel with this 
construction, the cat gene was also inserted into pAN257, between the same upstream 
EcoBl site and a Bghl site in the middle of the ilvD coding region, to give pAN261 , 
which is deleted for a large portion of the iivD&ve (Figure 15). 
Using pAN261 and pAN263, the P26 ilvD cassette could then be installed in the B. 

25 subtilis chromosome in two steps. In the first step, pAN261 is introduced by 

transformation, selecting for chloramphenicol resistance, and then confirming an Ilv 
phenotype. In the second step, pAN263 is introduced, selecting for Ilv + , checking for 
chloramphenicol sensitivity, and confirming correct local structure by PCR. 

pAN261 was first transformed into strain RL-1 {highly competent) to :give-sirain 
30 PA343 (AilvD::caf), and then chromosomal DNA from PA343 was used to transform 
PA340 and PA342 to Ilv" auxotrophy, yielding strains named PA348 and PA349, 
respectively. Chromosomal DNA is inherently more efficient than monomelic plasmid 
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in transforming B. subtilis. Similarly, pAN263 DNA was transformed into PA343 
(moderately competent) to give strain PA345 (P26 */vZ)), and then PA345 chromosomal m 
DNA was used to transform PA348 and PA349 to Ilv + prototrophy, yielding strains 
PA374 and PA354, respectively. 

5 As predicted, PA374 and PA354 gave further increases in pantothenate 

production, to about 2.5 to 2.9 g/1, in test tube cultures grown in SVY plus 5 g/1 P- 
alanine (Table 17). 

Table 17. Pantothenate and valine production by PA374 and PA354, containing P 2 6 
10 ilvD, and controls, in 24 hr test tube cultures grown in SVY with 5 g/l /?- 

alanine and with or without 5 g/l a-KIV. 









ODeoo 


[pan] 
g/1 


{val] 
g/1 


Strain 


Back- 
ground 


ilvD status 


o-KIV 


+ 


a-KIV 


+ 


a-KIV 


+ 


PA340 


PY79 


w.t. 


9.2 


9.0 


2.14 


2.23 


0.38 


0.90 


PA348 


PY79 


ilvD::cat 


11-7 . 


10.0 


0.19 


2.23 


0.19 


0.91 


PA374-1 
PA374-2 


PY79 
PY79 


P26HvD 


9.1 
8.2 


7.3 
7.7 


2.93 
2.99 


2.40 
2.36 


0.58 
0.60 


0.87 
0.95 


PA342 


RL-1 


w.t. 


10.2 


8.0 


1.29 


1.89 


0.27 


0.78 


PA349 


RL-1 


ilvD::cat 


8.1 


7.7 


0.17 


1.87 


0.22 


0.88 


PA354-1 
PA354-2 


RL-1 
RL-1 


P26ihD 
P26ilvD 


9.6 

7.5 


9.6 
8.2 


2.57 
2.48 


2.03 
2.24 


0.65 
0.64 


1.23 
0.97 



In the absence of added p-alanine, strains PA374 and PA354 produced only 
1 5 about 0.2 g/1 pantothenate in test tube cultures, indicating that PanD activity is 
significantly rate limiting. 

To alleviate this limitation, the amplifiable P 2 e panD cassette from strain PA401 
was installed. PA401 chromosomal DNA was transformed into PA374 and PA354, 
selecting for Tef at 15 mg/1, to yield strains PA377 and PA365, respectively. After 
20 transformants were obtained, the strains were streaked on plates containing 30 and 60 
mg/1 tetracycline to reamplify the copy number of the P 2 6 panD cassette integrated at the 
bpr locus. In test tube cultures grown in SVY without a-KIV or p-alanine, a substantial 
improvement in pantothenate titers over those of PA374 and PA354 was obtained 
(Tables 18 and 19). 
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Table 18. Pantothenate production by PA365, -containing amplified P 2 6panD, and 
controls, in 24 and 36 hr test tube -cultures grown in SVY-glucose without 







OD 6 oo 


{pan] g/1 




Rplpvant crf>nntvne 








36 hrs. 


Strain 


24hrs. 


36hrs 


24hrs. 


PA342-1-1 


w.t. ilvD 


11.7 


8.8 


b.d. 


0.27 


PA342-1-2 


w.t. ilvD 


12.8 


8.8 


b.d. 


026 


PA354-1-1 


P26ihD 


n.d. 


11.0 


n.d. 


0.19 


PA354-1-2 




n.d. 


8.4 


n.d. 


020 


PA365-1 


P26'l 1 »D,P26panD 


9.8 


10.0 


1.01 


2.07 


PA365-2 


P26ilvD.P26panD 


9.9 


10.4 


0.96 ' 


2.09 



n.d. = not determined; b.d. = below detection 



Table 19. Pantothenate production by PA3 77, containing amplified P 2 6 panD, and 
controls, in 27 hr test tube cultures grown in SVY-glucose or SVY- 
1 0 maltose, without a-KJV, and with or without fi-alanine. 







OD 60 o 


Strain 


Relevant genotype 


-P-ala 
Glucose 


+ P-ala 
Glucose 


- p-ala 
Maltose 


+ P-al* 
Maltose 


PA374-1 
PA374-2 


P26 ilvD 
P26 tlvD 


9.4 
9.2 


9.8 
9.6 


7.0 
6.6 


6.4 
6.3 


PA377-1 
PA377-2 


P26ilvD,P26panD 
P26ilvD,P26partD 


10.0 
10.5 ' 

[pan] g/1 


7.6 
7.8 


7.2 
9.4 


6.1 
5.4 


Strain 


Relevant genotype 


-P-ala 
Glucose 


+ P-ala 
Glucose 


-P-ala 
Maltose 


+P-ala 
Maltose 


PA374-1 
PA374-2 


P26 ih£> 
P26ihD 


0.04 
0.10 


2.76 
2.65 


0.14 
0.15 


1.31 
1.33 


PA377-1 
PA377-2 


P26iIvD,P26panD 
P26i!vD.P26panD 


1.25 
1.25 


2.76 
2.35 


1.26 
1.31 


1.10 
1.26 



In SVY with glucose, an increase in pantothenate production can still be 
1 5 achieved by feeding 5 gA (J-alanine suggesting that increasing panD expression further 
might increase pantothenate production. In SVY with maltose, no further increase in 
pantothenate was obtained by feeding fi-alanine suggesting that ^-alanine and/or 
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aspartate synthesis is suppressed by glucose. Strains PA377 and PA365 have been 
evaluated in 10 liter fermentors, where they typically produce above 20 g/1 pantothenate 
in 48 hours without supplemental p-alanine and a-KIV or valine, described in detail 
below. 

i 

5 

EXAMPLE X: 10 liter Fermentations of Pantothenate-Producing Microbes 

Engineering of the P 2 e UvBNC and P 26 i/vD cassettes to give strains PA342 and 
PAS 54 allowed the production of 22 and 26 g/1 of pantothenate, respectively, without 
the addition of valine or a-KIV to the fermentation medium (Table 20). At 48 hours, 
1 0 both strains had secreted about 0.5 g/1 of valine into the medium. 



Table 20. 10-liter fermentations of five pantothenate overproducing strains. 



Strain 


Medium 


Feed 
40% Glucose 
plus 


OD600 
48 hr 


Valine 
48 hours 
g/1 


p-ala 
48 hr 

g/1 


Pantothenate 
g/L 

36 hr 48 hr 72 hr 


PA 236 


SVYG 


50 g/1 P-ala 
25 g/1 a-KTV 


108 


added 


. added 


16 


19 


21 


PA 342 


SVYG 


50 g/1 p-ala 


92 


0.5 


added 


17 


22 




PA 354 


SVYG 


50 g/1 P-ala 


90 


0.5 


added 


19 


26 




PA 365 


SVYG 


25g/lYE 


77 


0.85 


0.4 


18 


21 


27 


PA 377 


SVYG 


25g/lYE 


85 


1.5 


0.5 


18 


22 


31 


PA 377 


PFMG 


25g/lYE 


96 


0.8 


0.4 


19 


25 


29 


PA377 


PFMG 




71 


0.7 


0.1 


16 


21 





15 Pantothenate synthesis in fermentors 

With the addition of the P 26 panD cassette to strains PA354 and PA374 to create 
strains PA365 and PA377, neither ^-alanine nor a-KTV needed to be added to the 
fermentors. Strain PA365 produced 21 g/1 pantothenate in 48 hours and 27 g/1 in 72 
hours with no precursors added to the medium (Table 20). PA377 was somewhat better, 
20 producing 1 8 g/1 of pantothenate in 36 hours, 22 g/1 in 48 hours, and 3 1 g/1 in 72 hours). 
Valine was measured at 0.85 and 1.5 g/1 for strains PA365 and PA377, respectively, at 
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48 hours in SVYG medium. Strain PA377 maintained valine between 1-1 .5 g/1 
throughout most of the fermentation and 6-alanine between-0.2 and 0.5 g/1. 

Strain PA377 was further evaluated in 10-liter fermentors in yeast extract based 
PFMG medium. Pantothenate yields in PFMG and SVYG medium were similar. In 
5 PFMG, PA377 produced 19 gA of pantothenate in 36 hours, 25 g/1 in, 48 hours, and 29 
g/1 in 72 hours. In SVYG, PA377 produced 18 g/L pantothenate in 36 hours, 22 in 
48 hours and 3 1 g/L in 72 hours (Table 20). 

EXAMPLE XI: Converting Strain PA377 to a Tryptophan Prototroph 

10. 

PA377 (Trp") was transformed to Trp + using chromosomal DNA'from PY79 to 
give strain PA824. After re-amplification of the /> 25 /Kw£>casette, PA824 was compared 
to PA377 for pantothenate production in test tube cultures grown in*SVY glucose with 
or without 5 g/L P-alanine (Table 21). 

15 

Table 21 : Trp* derivatives o/PA377: Pantothenate production in 48 hour test tube ' 
cultures grown in SVY glucose, ^-alanine 



Strain 


trpC donor 


OD 6 oo 


[pan] -g/L 


- {3-alanine 


+ p-alanine 


- ^-alanine 


+ p-alanine 


PA377-1 


RL-1 


8 


8 


1.5 


3.4 


PA377-2 


RL-1 


8 


9 


1.6 


3:6 


PA824-1 


PY79 


12 


10 


0.7 


3.7 


PA824-2 


PY79 


11 


11 


1.9 


419 



The Trp+ strains grew to slightly higher densities man PA377. In the absence of 
exogenous p-alanine, all of the strains produced similar levels of pantothenate, while 
with the addition of p-alanine, the Trp+ derivatives produced somewhat more 
pantothenate. 



25 Fermentor studies with PA824 

PA824 was evaluated in CF3000 Chemap 14 liter vessels with 10 liter working 
volumes. Formulations for two of the media used in the fermentors are given in Tables 
22 and 23. 
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Table 22 : Formulation for PFMG-5 medium 



BATCH 





MATERIAL 


g/L (final []) 


1 


Amberex 1003 


10 ! 


2 


Na Glutamate 


5' 


3 


(NH4)2S0 4 


8 








4 


MAZUDF37C 


2.5 








Added After Sterilization and Cool Down 








1 


KH2PO4 




2 


K 2 HP0 4 -3H 2 0 


20 








1 


1 Glucose 


20 


2 


MgCl 2 -6H 2 0 


1 


3 


CaCl 2 -2H 2 0 


0.1 








1 


Sodium Citrate 


1 


2 


FeS0 4 -7H 2 0 


0.01 


3 


SM-1000X 


1.0 ml 










H 2 0 


qs to 6000 ml 


FEED 




MATERIAL 


g/L 


1 


Glucose 


600 


2 


CaCl 2 -2H 2 0 


0.6 










H 2 Q 


qs to 3000 ml 
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Table 23 : Formulation for SVY-4 medium 



BATCH 





MATERIAL 


g/L (final [)) 


1 


Veal Infusion 


25 


2 


Yeast Extract 


5 


3 


Na Glutamate 


5 


4 


(NH4)2S0 4 


. 4 








5 


MAZU DF 37C 


2.5 








Added After Sterilization and Cool Down 








1 


KH 2 P0 4 


10 


2 


K 2 HP0 4 -3H 2 0 


20 








1 


Glucose ' 


20 


2 


MgCl 2 -6H 2 0 


1 


3 


CaCl 2 -2H 2 0 


0.1 1 








1 


Sodium Citrate 


1 


2 


FeS0 4 7H 2 0 


0.01 


3 


SM-1000X 


1.0 ml 










H 2 0 


qs to 6000 ml 


FEED 




MATERIAL 


g/L 


1 


Glucose 


600 


2 


CaCl 2 2H 2 0 


0.6 










H 2 Q 


qs to 3000 ml 



All fermentations were glucose limited fed batch processes. Immediately after 
inoculation, agitation was set at 200 rpm. The initial batched. 2% glucose was consumed 
10 during exponential growth. Afterwards, glucose concentrations were maintained 
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between 0.2 and 1.0 g/L by continuous feeding of a 60% glucose solution. The variable 
rate feed pump was computer controlled and linked to the dissolved oxygen 
concentration [p0 2 ] in the tank by an algorithm. When the [p0 2 ]fell to 30%, computer 
control began to automatically adjust the agitation rate to maintain a dissolved oxy gpn 

5 concentration between 25 and 30% [p0 2 ]. Computer control and data recording were by 
Braun MFCS software. 

In one study, PA284 was grown in fermentors at two temperatures (40°C and 
43°C) in the medium described in Table 22. Results of two experiments demonstrated 
that the highest pantothenate titers at early time points were produced at 43°C. The cell 

10 mass approached 1 50 optical density units at OD^oo and 56 hours at 43°C, and the 

pantothenate titers were 21 g/L, 28 g/L and 36 g/L at 36, 48 and 72 hours respectively. 
In the parallel fermentation at 40°C, the cell mass approached 120 optical density units 
at OD600 and 56 hours, and the pantothenate titers were 18 g/L, 26 g/L and 37 g/L at 36, 
48 and 72 hours, respectively. 

15 In another study, PA824 was grown in a fermentor at 43°C in the medium 

described in Table 23. The cell mass exceeded 160 optical density units at OD600 and 36 
hours, and the pantothenate titers were 23 g/L, 34 g/L, 37 g/L and 40 g/L at 24, 36, 48 
and 60 hours, respectively. Ill other fermentations, increasing the amount of trace 
elements in the glucose feed (e.g., increasing the concentration of SM from IX to 2X) 

20 resulted in even higher titers of pantothenate. 

EXAMPLE XII: Identification and characterization of the B. subtilis coaA gene 
product 

The annotated version of the B. subtilis genome sequence available on the 
25 "Subtilist" web site contains no gene labeled as coaA. However a homology search 
using the protein sequence of E. coli pantothenate kinase as a query sequence gave a 
good match with B. subtilis gene yqjS, which is annotated as "unknown; similar to 
pantothenate kinase." This gene appears to be the penultimate gene in an operon 
containing five open reading frames {Figure 1 8). Two of the open reading frames 
30 encode proteins which are similar to D-serine dehydratase and to "ketoacyl reductase"; 
the other two have no known homologies. For the open reading frame corresponding to 
coaA, there are three possible start codons; each having a possible ribosome-binding site 
(RBS) associated with it The three potential coaA ORFs were named coaAl t coaA2, 
and coaA J, from longest to shortest. 
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All three potential coaA open reading frames were cloned along with their 
respective RBSs by PCR followed by ligation into expression plasmid pAN229. 
pAN229 is a low copy vector in E. coli that provides expression from the SP01 phage 
1 Pjs promoter and can integrate by single crossover at bpr with tetracycline selection. A , 
5 representative resulting plasmid, pAN28 1 , is shown in Figure 1 9. 

To determine if the cloned putative coaA ORFs actually encode a pantothenate 
kinase activity, several isolates of all three plasmids were transformed into the £ coli, 
strain YH1, that contains the coaA15(Ts) allele. Transformants were streaked to plates 
incubated at 30° and 43°C to test for complementation of the temperature sensitive 
10 allele. All isolates of all three coaA variants, except for one isolate of pAN282, 

complemented well at 43°C, indicating that all three plasmid constructs encode an active 
pantothenate kinase. Accordingly, it can be concluded that the B. subtilis yqjS open 
reading frame codes for an active pantothenate kinase. , 

1 5 EXAMPLE XIII: Deletion of the coaA gene from the B. subtilis genome 

The coaA gene of B. subtilis AyqjS) was deleted from the-chromosome of a B. 
subtilis strain by conventional means. The majority of the coaA coding sequence was 
deleted from a plasmid clone and replaced by achloramphenicol resistance .gene (cat), 
while leaving approximately 1 kb of upstream and downstream sequence to allow 

20 homologous recombination within the chromosome, to give plasmid pAN296<see 

Figure 17). pAN296 was then used to transform a B. subtilis strain (PY79), selecting for 
chloramphenicol resistance. The majority of transformants*esult from a double 
crossover event that effectively substitutes the car^ene for the coaA gene. The 
transformed strain containing the coaA deletion -<at insertion grew normally due the 

25 presence of a second B. subtilis pantothenate kinase encoding gene described herein. 

EXAMPLE XTV: Identification and characterization of a second B. subtilis gene 
encoding pantothenate kinase activity 

As described in detail in the instant specification, in order to maximize 
30 pantothenate production, it is necessary to restrict the flow of pantothenate toward 
Coenzyme A (CoA), for example, by reducing the activity of pantothenate kinase, the 
first en2yme in the pathway from pantothenate to Co A. After finding that deletion of 
the coaA gene from the chromosome of B. subtilis is not a lethal event fsee Example 
XIII), it was concluded that B. subtilis must contain a second gene that encodes an active 
35 pantothenate kinase, since pantothenate kinase is anessential enzyme activity. 
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A second pantothenate kinase-encoding gene was identified by complementing 
the E. coli strain YH1 (coaA15(Ts)) with a B. subtilis gene bank and selecting for 
transformants that were able to grow at 43°C. Found among the transformants were two 
families of plasmids that had overlapping restriction maps within each family, but not 
5 between the families. As expected, the restriction map of one family was identical to 
that predicted from the B. subtilis genome sequence for the homologue of the R coli 
coaA gene (which we named coaA also, see above) and surrounding sequences. The 
other family had a restriction map that was completely non-overlapping with the first. 

DNA sequencing of the ends of the cloned inserts from the second family 
1 0 showed that the clones came from a region of the B. subtilis chromosome that includes 
the 3' end of the ftsR gene, the 5' end of the sul gene, and all of the yacB, yacC, yacD, 
cysK, pabB, pabA and pabC genes. None of the open reading frames of these cloned 
inserts showed homology to any known pantothenate kinase sequences, either 
prokaryotic or eukaryotic. 

1 5 Several deletions were created through the B. subtilis genomic sequences in the 

cloned inserts. Each deletion was tested for complementation of the E. coli temperature 
sensitive pantothenate kinase. In particular, a deletion that removed all DNA between a 
Stu I site in the cloning vector and a Swa I site in the yacC gene, leaves yacB as the only 
intact open reading frame in the cloned insert (see Figure 21). This deleted plasmid still 

20 complemented the R coli pantothenate kinase mutant. However, another deletion that 
removed DNA from the Swa I site in yacC through a Bst 1 1 071 site in the (already 
truncated) ftsH gene, could not complement the E. coli pantothenate kinase mutant. 
From these results, it was concluded that ih&yacB open reading frame was responsible 
for the complementation activity. To confirm ihatyacB is a pantothenate kinase gene, 

25 the yacB ORF plus 1 12 base pairs of downstream flanking sequence was amplified by 
PCR in two independent reactions and cloned downstream of a constitutive promote to 
give plasmids pAN341 and pAN342 (Figure 22). Both pAN341 and pAN342 
complemented the defect in YH1 at 44°C, while a control plasmid, which has the same 
backbone, but expresses panBCD instead of yacB did not. This confirmed that the yacB 

30 open reading frame was responsible for the complementation of YH1 . 

As such, a novel gene that encodes pantothenate kinase activity in R subtilis has 
been discovered that is not related by homology to any previously known pantothenate 
kinase gene. This gene has been renamed coaX, as a second, alternative gene that 
encodes an enzyme that catalyzes the first step in the pathway from pantothenate to 
35 CoaA Deletion of coaX by methods described above for deleting coaA> in conjunction 
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with reduction in the activity of *he GoaA enzyme, provides a means to reduce 
pantothenate kinase activity to the desired level. 

Several homologues of the B. subtilis coaX gene were identified by homology 
searching of various publically available databases using the published yacB(coaX) 
5 open reading frame sequence and predicted amino acid "sequence {as set forth in SEQ ID 
NOs:84 and 85 respectively). In two -cases (Mycobacterium tuberculosis and 
Streptomyces coelicolor) the homologous coaX genes are adjacent to, or almost adjacent 
to, pantothenate biosynthetic genes, consistent with these homologs having a role in 
pantothenate metabolism. The CoaX proteins show no homology to the CoaA family of 
1 0 pantothenate kinases, nor to the eukaryotic family of pantothenate kinases exemplified 
by PanK of Saccharomyces cerevisiae. 

Alignment of the amino acid sequences of several bacterial CoaX homologs with 
the amino acid sequence predicted from translating the 5. subtilis yacBVRF described 
in the published B. subtilis genome sequence revealed that the CoaX proteins from other 
1 5 bacteria contained additional amino acid residues at their carboxy-terminal ends. 
Moreover, these extensions beyond the end of the predict amino acid for the R 
subtilis gene product contained two relatively well-conserved segments of sequence. 

Translation of nucleotide sequences just downstream from the-stop codon of the 
B. subtilis yacB ORF in a different reading frame-revealed the existence of amino acid 
20 sequences very similar to the carboxy-terminal extensions of the other bacterial CoaX 
proteins. It is thus believed that an error exists in the published DNA sequence of the B. 
subtilis yacB ORF sequence that causes a frame shift leading to an artifactual 
downstream amino acid sequence and premature termination. 

The PCR-generated sequences of B. subtilis CoaX in pAN341 and pAN342 
25 (described above) contain enough downstream flanking -sequence to encode the putative 
carboxy-terminal extension described above, which is consistent with the result that the 
clones were functional in the complementation assay. However when the 3' PCR 
primer was positioned to include only the shorter yacB ORF predicted from the 
published sequence, but not to include the putative carboxy-terminal extension, then the 

30 resulting plasmids, pAN329 and pAN330 (similar in structure to pAN341 and pAN342; 
see Figure 22), did not complement the defect in YH1 . This result supports the notion 
that the published yacB coding sequence contains a frame-shift error, and that the 
carboxy-terminal end of CoaX is necessary for pantothenate kinase activity. The 
predicted correct nucleotide sequence for B. subtilis coaX is set forth as SEQ ID NO:19 

35 and the translated amino acid sequence is*set forth as SEQ ID NO:9. A multiple 



BNSOOCtD: <WO . _ 0121772A3 IA> 



WO 01/021772 



-95- 



PCT/US00/25993 



sequence alignment of the CoaX amino acid sequences of B. subtilis and 1 1 homologues 
thereof is set forth in Figure ,23. 

EXAMPLE XV: Generation of mutant coaA genes encoding pantothenate kinase 
5 having reduced or temperature sensitive activities 

This Example describes strategies for modifying the coaA gene (i.e. 9 by 
introducing point mutations) to reduce the activity of pantothenate kinase after coaX is 
deleted from the genome. 

10 Cloning and sequencing of the temperature sensitive allele of the E. coli coaA gene. 

Two E. coli strains, each exhibiting a different mutant CoaA phenotype, were 
obtained from the E. coli Genetic Stock Center. Strain DV62 contains the coaA15(Ts) 
allele, and DV79 contains the coaA16(Fr) mutation. DV62 is temperature sensitive at 
43 °C and produces a pantothenate kinase that is temperature sensitive. DV79 was 

1 5 obtained by reversion of DV62 to temperature resistance, and it produces a temperature 
stable, feedback resistant pantothenate kinase activity. Since the DNA sequences of 
these alleles are not available in the literature, the coaA genes from the two mutant 
strains were cloned by PCR and sequenced, in addition to a coaA gene from a strain that 
is wild type at the coaA locus, MM294. The PCR primer at the 5* end was designed to 

20 include the start codon plus four bases upstream, and added an arbitrarily chosen 
ribosome binding site (RBS). The three PCR generated fragments were each ligated 
between the Xbal and BamHI sites of pAN229 to give pAN284 (from coaA15(Ts)\ 
pAN285 (from wild type coaA), and pAN286 (from coaA16(Fr)\ pAN229 is a low 
copy E. coli vector that provides expression from the Pjs promoter and that can integrate 

25 by single crossover at bpr in B. subtilis with tetracycline selection. 

All three plasmids were transformed into the E. coli strain YH1 for 
complementation testing. All three plasmids complemented the temperature sensitive 
coaA mutation in E. coli YH1 . It is presumed that the coaA15(Ts) gene in pAN284 is 
probably significantly overexpressed relative to the normal chromosomal gene, such that 
30 the overproduction compensates for the temperature sensitive defect. Complementation 
of a defect by overproduction is a well-documented phenomenon in E. coli. 

The coaA coding regions from pAN284, 285, and 286 were subcloned into 
pGEM7 to give pAN306, 307, and 308, respectively, for DNA sequencing. As 
expected, the DNA sequence of the insert in pAN307 (from wild type coaA) matched 
35 the coaA sequence from the E. coli genome database (GenBank™). The sequence from 
pAN306 contains a single base change that causes a S176L substitution (JLe. 9 a Ser -> 
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Leu substitution in the amino acid sequence set Forth as SEQ ID NO:2). interestingly, 
the DNA sequence of the pAN308 insert, derived from the feedback resistant strain, was 
identical to that derived from its temperature sensitive parent (represented in pAN306). 
1 This is in accord with the genetic data that indicates that the reversion of the temperature , 

5 sensitive mutation occurred at a second site unlinked to the coaA gene. 

< 

The S176L mutation, predicted tocause the temperature sensitive defect in E. 
coli pantothenate kinase, changed a serine residue that is conserved in all known or 
suspected bacterial coaA encoded pantothenate kinases, including that of 5. subtilis (see 
SEQ ID NO:3 and refer to alignment). Based on this, a serine to leucine change at the 

1 0 homologous residue in the B. subtilis pantothenate kinase is predicted to result in either 
a temperature sensitive enzyme or one which is less active. Accordingly, to produce a 
mutant B. subtilis coaA £ene, this specific change was introduced into the B. subtilis 
coaA gene. The mutant version is installed in the chromosome of a B. subtilis strain 
• deleted for coaX, for example, and the recombinant microorganism is checked for 

1 5 temperature sensitivity (e.g. , reduced growth at 43°C). The mutation is then installed 
into a pantothenate overproducing strain, preferably asfrain deleted for the above 
mentioned coaX gene by standard methods to .give strains favorable for pantothenate 
production in B. subtilis, Le.,a strain that has reduced pantothenate kinase activity under 
typical fermentation conditions. 

20 

Additional coaA point mutations resulting in reduced pantothenate kinase activity 

Of course it is expected that many other point mutations or combinations of more 
than one point mutation in B. subtilis coaA will also lead to reduced activity. 
Appropriate mutations can be generated by mutagenic polymerase chain reaction and in 
25 vitro recombination, and identified by screening for alleles that poorly complement the 
E. coli coaA15(Ts) mutant. An example of such a mutation of this type is a tyrosine to 
histidine substitution at amino acid 181 of B. subtilis coaA, generated by mutagenic 
polymerase chain reaction (see SEQ ID NO:3 and first line of the alignment of Figure 
24). 

30 Isolate pAN282A was derived from the middle-sized B. subtilis coaA open 

reading frame described in Example XII. pAN282A complemented the Exoli 
coaA15(Ts) mutant very poorly, but nonetheless at a level that was detectable above 
background. As was done for the E. coli coaA clones, the open reading frame from 
pAN282A was subcloned into pGEM7 to givepAN303. The DNA sequence of the 

35 insert in pAN303 showed a single base change that led to a tyrosine to histidine amino 
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acid change at the tyrosine corresponding to Yl 81 of SEQ ID NO:3. This tyrosine 
residue is conserved in all bacterial coaA genes/homologues present in GenBank (Figure 
24). This tyrosine residue and the serine that is altered in the E. coli temperature 
sensitive pantothenate kinase described above are separated by only three amino aci£ 

5 residues in a region which is highly conserved in bacterial pantothenate kinases whereas 
the DNA sequence of a second isolate of the middle-sized open reading frame, from 
pAN282B, was identical to the wild type sequence from the 2J. subtilis genome 
sequencing project. The single base change found in pAN303 probably occurred during 
PGR amplification of the coaA gene. If this variant of coaA2 has sufficient residual 

1 0 biological activity in B. subtilis, it may be useful in the future for providing reduced 
pantothenate kinase activity. 

A preferred plasmid that can serve as a basis for mutag^nizing the coaA open 
reading frame is pAN294 (see e.g., Figure 25 and Example XII). Briefly, mutagenic 
PCR is performed using pAN294 as a template and variants of coaA having reduced 

1 5 pantothenate kinase activity are screened as described above. Alternatively, mutations 
such as the one isolated in pAN282A can be installed into pAN294. The desired 
mutation is then introduced into the chromosome of a B. subtilis strain by transformation 
with the appropriate pAN294 derivative and selected for chloramphenicol resistance at 5 
mg/L. Among the resulting transformants will be isolates that contain the desired 

20 mutation. 

In a similar fashion, mutations that reduce the activity of the CoaX enzyme can 
be generated and identified, and such mutations used for optimizing pantothenate 
production by reducing CoA production as described above. 

25 EXAMPLE XVI: Deleting the second pantothenate kinase gene, coaX gene from 
B. subtilis 

With the knowledge gained above concerning the existence and nature of coaX, 
one can create a deletion of the coaA'open reading frame from the B. subtilis 
chromosome that will remove the encoded activity, and that will not adversely affect the 
30 expression of the genes downstream from coaX. In such a deleted strain, the coaA gene 
will be the only gene that encodes pantothenate kinase. 
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To delete the coaX gene from B. subtilis, plasmid pAN336 (SEQ ID NO:92), 
which contains upstream and downstream homology for double crossover, was 
constructed with a kanamycin resistance gene replacing most of the coaXOBf (Figure 
26). Strain PY79 was transformed to kanamycin resistance by pAN336, and an isolate 

5 confirmed to have resulted from a double crossover by PCR was named PA876. As 
predicted, deletion of coaXby itself is not lethal for B. subtilis. Furthermore, 
chromosomal DNA from PA876 would not transform competent PAS61 (PY79 JcoaA 
::cat) to kanamycin resistance. These results indicate that it is the combination of 
AcoaAwcat and AcoaX :: kan that is lethal for B. subtilis, -confirming that 5. subtilis 

10 contains two unlinked genes that encode pantothenate kanase, coaA andvoaX, and that 
either gene alone is capable of supplying sufficient pantothenate kinase for a normal rate 
of growth. 

EXAMPLE XVII: Construction of a plasmid designed to allow directed 
1 5 mutagenesis of the JJ. subtilis coaA gene 

In order to easily introduce mutated coaA genes into the B. subtilis chromosome, 
it was necessary to install an antibiotic resistance gene adjacent to the coa^^ene. This 
was accomplished by joining together in the vector pGEM5 three DNA fragments: (1) a 
3.4 kb DNA sequence containing 2:5 kb of genomic sequence upstream from-voaA and 
20 the coaA open reading frame(s); <2) a 1 . 1 kb DNA sequence containing a 

chloramphenicol resistance gene (cat); and<3) a 1.4 kb DNA sequence comprising a 
region downstream from the operon that contains coaA . The resulting plasmid, named 
pAN294, effectively replaces the open reading frame yqj'T (the open reading frame just 
downstream from coaA) with the cat gene, with enough homology flanking both -sides of 
25 the cat gene to allow double recombination into the B. subtilis chromosome (Figure 25). 
pAN294 was transformed into B. subtilis strain PY79, selecting for chloramphenicol 
resistance at 5 mg/1 to give strains PA836 and PA837, which are presumably identical. 
PA836 and 837 were checked by diagnostic PCR to show that the caf rgene had 
integrated by double crossover, as opposed to single crossover. PA836 and PA837-grow 

30 normally, leading to the conclusion that the open reading frame yqjT is not -essential 
(i.e., ihtyqjT open reading frame could be deleted from strains PA836 and PA837 with 
no significant effect on growth or pantothenate production). Thus, variant alleles<i.e, 
mutations) of the coaA gane can be introduced into pAN294 and the resulting plasmids 
can be used to introduce the variant alleles into the chromosome of, for example, a B. 

35 subtilis strain. 
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EXAMPLE XVHI: Generation of mutant coaX genes encoding pantothenate 
kinase having reduced or temperature sensitive activities 

Mutant coaX genes are generated by introducing point mutations into the gene 
and testing the resulting mutants for the ability to complement the E. coli YH1 strain as 

5 described in Example XII. Preferred mutations in the coaX gene seqeunces are those 
that encode a substitution of a residue conserved among CoaX sequences from a variety 
of bacterial sources (e.g., a conserved residue set forth in Figure 23). Alternatively, 
random mutations in the coaX gene sequence are generated by mutagenic PCR and in 
vitro recombination and identified by screening for alleles that poorly complement the 

10 E. coli coaA15(Ts) mutant 

Mutants so generated (i.e., mutants having reduced coaX activity) can be further 
engineered such that the endogenous coaA gene is deleted (as described in Example 
XIII). CoaX reduced-activity mutants can also be further engineered to contain reduced- 
activity CoaA gene products as described in Example XV. 

15 ' 

EXAMPLE XIX: Enhanced Production of Panto-Compounds Using Bacteria 
Having Deletions in One or More Pantothenate Biosynthetic Enzymes 

If the desired panto-compound is not pantothenate, then an appropriate deletion 

20 of one or more of the pantothenate biosynthetic genes from a pantothenate 

overproducing strain will provide a strain that produces said desired panto-compound. 
In this example, the desired panto-compound is pantoate. Starting with, for example, 
strain PA236, PA3 1 3 or PA824 either one or both of the panC and pariD genes is 
deleted. In another example, ketopantoate is the desired panto-compound. Starting 

25 with, for example strain PA244, PA245 or PA824 one, two or all of the i/vC, panEl, 
panC and pariD genes are deleted from the starting strain. If ^-alanine is the desired 
panto-compound, then panB and panC can be deleted, preferably in a fashion that leaves 
an in frame fusion of a small portion of the 5 5 end of pariB with a small portion of the 
3' end ofpanC, from the strain PA221, PA235, PA245, or PA313. In all of the above- 

30 mentioned examples, the panto-compound producing strain will be a pantothenate 
auxotroph. Accordingly, the growth medium requires sufficient pantothenate for 
adequate growth. Vectors designed to overexpress panD as described above are then 
transformed into the above strains to further enhance P-alanine production. 

The above-mentioned deletions are accomplished by methods well-known to 

35 those skilled in the art, for example, by insertion of an antibiotic resistance gene and 
removing sufficient sequence from the target gene(s) to inactivate said target gene(s). 
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Altemativeiy, removal of targeted sequences is accomplished without simultaneous 
introduction of an antibiotic resistance gene in said target gene and then introduced by 
concession (co-transformation with any other appropriate selectable DNA sequence) 
followed by screening for the loss of function of said target gene by replica plating. 

Table 24 : Strains (and corresponding phenotypes) for panto-compound production 



Name Pheno 


Drug 


panBCD 


panE locus 




type 


resist. 


' locus 




T) AO') 1 


Tm- 

irp- 




P26panBCD 




FA22Z 






P is panBCD 




PA235 






P26panBCD 




PA236 






P 26 panBCD 


P 26 panEl 


PA327 


Trp- 




P26panBCD 


P26panEl 


PA328 


Trp- 




P26panBCD 


P26panEI 


PA340 


Trp- 


Spc 


P26panBCD 


P26panEl 


PA342 


Trp- 


Spc 


P26panBCD 


P26panEJ 


PA354 


Trp- 


Spc 


P26panBCD 


P26panEl 


PA365 


Trp- 


Spc, 


P26panBCD 


P26panEl 






Tet 






PA374 


Trp- 


Spc 


P26panBCD 


P26panEl 


PA377 


Trp- 


Spc, 


P26panBCD 


P26panEl 






Tet 






PA401 


Trp- 




P26panBCD 




PA402 


Trp- 




P26panBCD 




PA403 


Trp- 




P26panBCD 




PA404 


Trp- 


Spc, 


P26panBCD 


P26panEJ 






Tet 






PA405 


Trp- 


Spc, 


P26panBCD 


P26panEJ 






Tet 






PA651 


Trp- 


Spc 


P26panBC*D P26panEl 


PA284 




Spc, 


P26 , panBCL 


> P26panEl 



locus 



amyE locus bpr locus Parent 



RL-1 

PA221 
PA221 
PA235 
PA327 
PA328 
PA342 
PA354 



P26ilvBNC 
P26ilvBNC 



P26ilvBNC 
P26ilvBNC 



P26panD423 

P26panD423 
P26panD428 
P26panD429 
P26panD423 



PA340 
PA374 

PA221 
PA221 
PA221 
PA340 



P26panD423.. PA342 



P26panD423 



PA374 
PA377 



Tet 



Equivalents Those skilled in the art will recognize, or be able to ascertain using no 
10 more than routine experimentation, many equivalents to the specific-embodiments s>f the 
invention described herein. Such equivalents are intended to be encompassed by the 
following claims. 
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What is claimed: 

1 . A method of producing a panto-compound comprising culturing a 
microorganism which overexpresses at least one Bacillus pantothenate biosynthetic 

5 ' enzyme under conditions such that the panto-compound is produced. 

2. The method of claim 1 , wherein the microorganism overexpresses at least 
one Bacillus subtilis pantothenate biosynthetic enzyme. 

10 3. The method of claim 1 or 2, wherein the pantothenate biosynthetic . 

enzyme is selected from the group consisting of ketopantoate hydroxymethyltransferase, 
pantothenate synthetase, aspartate-a-decarboxylase and ketopantoate reductase. 

4. The method of any one of claims 1 to 3, wherein the microorganism 
1 5 , overexpresses at least two pantothenate biosynthetic enzymes. 

5. The method of any one of claims 1 to 3, wherein the microorganism 
overexpresses at least three pantothenate biosynthetic enzymes. 

20 6. The method of any one of claims 1 to 5, wherein the panto-compound is 

selected from the group consisting of pantothenate, pantoate, ketopantoate and {3- 
alanine. 

7. A method of producing a panto-compound comprising culturing a 

25 ketopantoate reductase-overexpressing (KPAR-O) microorganism under conditions such 
that the panto-compound is produced. 

8. The method of claim 7, wherein the panto-compound is pantothenate or 
pantoate. 

30 

9. The method of claim 7 or 8, wherein the ketopantoate reductase is 
bacterial-derived. 

10. The method of claim 7 or 8, wherein the ketopantoate reductase is 
35 derived from Bacillus. 
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1 1 . The method of claim 7 or 8, wherein the ketopantoate reductase is 
derived from Bacillus subtilis. 

12. The method of any one of claims 7 to 1 1 , wherein the KPAR-0 

5 microorganism further overexpresses at least one pantothenate biosynthetic^nzyme in 
addition to overexpressing ketopantoate reductase. 

1 3 . The method of claim 12, wherein the KPAR-O microorganism further 
overexpresses at least one of ketopantoate hydroxymethyltransferase, pantothenate 

1 0 synthetase and aspartate-owiecarboxylase. 

14. A method of producing pantothenate in a mannet independent of 
precursor feed comprising culturing an aspaitate-a-decarboxylase-overexpressing (AaD- 
O) microorganism having a deregulated isoleucine-valine<//v) pathway under conditions 

1 5 such that pantothenate is produced. 

15. A method of producing at least 2 g/L pantothenate in a manner 
independent of aspartate or p-alanine feed comprising culturing an aspartate-a- 
decarboxylase-overexpressing (AaD-O) microorganism under conditions such that 

20 pantothenate is produced. 

1 6. A method of producing at least 2 -g/L pantothenate in a manner 
independent of valine or a-ketoisovalerate feed comprising culturing a microorganism 
having a deregulated isoleucine-valine<*7v) biosynthetic pathway under conditions such 

25 that pantothenate is produced. 

17. A method of producing at least 30 g/L pantothenate in a manner 
independent of aspartate or P-alanine feed comprising culturing an aspartate-a- 
decarboxylase-overexpressing (AaD-O) microorganism under conditions such that 

30 pantothenate is produced. 

18. A method of producing at least 30 g/L pantothenate in a manner 
independent of valine or a-ketoisovalerate feed comprising culturing a microorganism 
having a deregulated isoleucine-valine (i/v) biosynthetic pathway under conditions such 

3 5 that pantothenate is produced. 
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19. A P-alanine independent high yield production method for producing 
pantothenate comprising culturing a manipulated microorganism under conditions such 
that pantothenate is produced at a significantly high yield. 

5 20. The method of any one of claims 1 4 to 1 9, wherein the microorganism 

overexpresses acetohydroxyacid synthetase or is transformed with a yector comprising 
an ilvBN nucleic acid sequence or an alsS sequence. 

2 1 . The method of any one of claims 1 4 to 1 9, wherein the microorganism 
1 0 overexpresses acetohydroxyacid isomeroreductase or is transformed with a vector 

comprising an HvC nucleic acid sequence. 

22. The method of any one of claims 1 4 to 1 9, wherein the microorganism 
overexpresses dihydroxyacid dehydratase or is transformed vyith a vector comprising an 

15 . ilvD nucleic acid sequence. 

23. The method of any one of claims 1 9 to 22, wherein the microorganism 
overexpresses aspartate-a-decarboxylase or is transformed with a vector comprising a 
panD nucleic acid sequence. 

20 

24. The method of any one of claims 1 4 to 23, wherein the microorganism 
further has a deregulated pantothenate biosynthetic pathway. 

25 . The method of any one of claims 1 4 to 24, wherein the microorganism 
25 further has at least one mutant gene selected from the group consisting of a mutant avtA 

gene, a mutant ilvE gene, a mutant ansB gene and a mutant alsD gene. 

26. The method of claim 24, wherein the microorganism overexpresses any 
of ketopantoate hydroxymethyltransferase, ketopantoate reductase, pantothenate 

30 synthetase and aspartate-a-decarboxylase. 

27. The method of claim 24 or 26, wherein the microorganism is transformed 
with a vector comprising a panBCD nucleic acid sequence or a vector comprising a 
panEl nucleic acid sequence. 

35 
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28. The method of any one of claims 14 to 16 and 19 to 27, wherein 
pantothenate is produced at a level selected from the group consisting of a level greater 
than lOg/L, a level greater than 20g/L and a level greater than 40g/L. 

5 29. The method of claim 20, wherein the microorganism overexpresses 

acetohydroxyacid synthetase derived from Bacillus or is transformed with a vector 
comprising an ilvBN nucleic acid sequence or an alsS nucleic acid sequence derived 
from Bacillus. 



1 0 30. The method of claim 21 , wherein the microorganism overexpresses 

acetohydroxyacid isomeroreductase derived from Bacillus or is transformed with a 
vector comprising an ilvC nucleic acid sequence derived from Bacillus. 



3 1 . The method of claim 22, wherein the microorganism overexpresses 
1 5 dihydroxyacid dehydratase derived from Bacillus or is transformed with a vector 

comprising av ilvD nucleic acid sequence derived "from Bacillus. 

32. The method of claim 23, wherein the microorganism overexpresses 
aspartate-a-decarboxylase derived from Bacillus or is transformed with a vector 

20 comprising a panD nucleic acid sequence derived from Bacillus. 

33 . The method of claim 24 or 26, wherein the microorganism over-expresses 
any of ketopantoate hydroxymethyltransferase, ketopantoate reductase, pantothenate 
synthetase and aspartate-a-decarboxylase derived from Bacillus. 

25 

34. The method of claim 27, wherein the vector comprises a panBCD nucleic 
acid sequence or a panEl nucleic acid sequence *ierived from Bacillus. 

35 . A method of producing a panto-compound comprising contacting a 
30 composition comprising at least one pantothenate biosynthesis pathway precursor or 

isoleucine-valine biosynthesis pathway precursor with at least one isolated Bacillus 
enzyme selected from the group consisting of ketopantoate hydroxymethyltransferase, 
ketopantoate reductase, pantothenate synthetase and aspartate-a-decaiboxylase, under 
conditions such that the panto-compound is produced. 

35 
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36. A method of producing P-alanine comprising culturing an aspartate-a- 
decarboxylase-overexpressing (AaD-O) microorganism under conditions such that |3- 
alanine is produced. 

5 37. The method of claim 36, wherein the AaD-O microorganism has a 

mutation in a nucleic acid sequence encoding a pantothenate biosynthetic enzyme 
selected from the group consisting of ketopantoate hydroxymethyltransferase, 
ketopantoate reductase and pantothenate synthetase. 

10 38. A method of producing p-alanine comprising contacting a composition 

comprising aspartate with an isolated Bacillus aspartate-a-decarboxylase enzyme under 
conditions such that P-alanine is produced. 

39. A method for enhancing production of a pant^compound comprising 

1 5 ■ culturing a mutant microorganism having a mutant coaX gene under conditions such that 
the panto-compound production is enhanced. 

40. The method of claim 39, wherein said recombinant microorganism has a . 
mutant coaA gene. 

20 

41 . A method of producing a panto-compound comprising a pantothenate, 
kinase mutant microorganism under conditions such that the panto-compound is 
produced at a significantly high yield. 

25 42. The method of claim 4 1 , wherein said mutant microorganism has a 

mutant coaA gene. 

43 . The method of claim 4 1 , wherein said mutant microorganism has a 
mutant coaX gene. 

30 

44. The method of claim 4 1 , where said mutant microorganism has a mutant 
coaA and coaX gene. 

45. The method of any one of claims 39 to 44, wherein said panto-compound 
35 is selected from the group consisting of ketopantoate, pantoate or pantothenate. 
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46. Hie method of any one of claims 39 to 44, wherein said panto^compound 
is pantothenate. 

47. The method of any one of claims 39 to 44, wherein said panto-compound . 
5 is produced at a level selected from the group consisting of a leveLgreater than lOg/L, a« 

level greater than 20g/L and a level greater than 40g/L. 

48. The method of any one of claims 39 to 44, wherein said recombinant 
microorganism further has a deregulated pantothenate biosynthetic pathway or further 

10 has a deregulated isoleucine-valine {ilv) biosynthetic pathway . 

49. The method of claim any one of claims 39 to 44, wherein said 
recombinant microorganism further overexpresses panD and panE. 

1 5 50. The method of any one of claims 39 to 44, wherein said recombinant 

microorganism further has at least one mutant gene selected from the group-consisting 
of a mutant avtA gene, a mutant t/v£<gene, a mutant ansB gene and a mutant ulsD :gene. . , 

51. A method for enhancing production of a panto-compound comprising 

20 culturing a microorganism that has a deregulated pantothenate biosynthetic pathway and 
that also has a mutation that results in reduced pantothenate kinase activity under 
conditions such that the panto-compound production is enhanced. 

52. A method for identifying compounds which modulate pantothenate 

25 kinase activity comprising contacting a recombinant cell expressing pantothenate kinase 
encoded by the coaX gene with a test compound and determining the ability of the4est 
compound to modulate pantothenate kinase activity in said cell. 

53. The method of claim 52, wherein said cell further comprises a mutant 
30 coaA gene encoding apantothenate kinase having reduced activity. 

54. The method of any one of claims 1 to 5 1 , wherein the microorganism is 
Gram positive. 

35 55. The method of any one of claims 1 to 51, wherein the microorganism is 

Gram negative. 
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56. The method of any one of claims 1 to 5 1 , wherein the microorganism is a 
microorganism belonging to a genus selected from the group consisting of Bacillus, 
Cornyebacterium, Lactobacillus, Lactococci and Streptomyces. 

5 57. The method of any one of claims 1 to 5 1 and 54 to 56, wherein the 

microorganism is of the genus Bacillus. 

58. The method of any one of claims 1 to 51 and 54 to 57, wherein the 
microorganism is Bacillus subtilis. 

10 

59. The method of any one of claims 1 to 13, 35, 39 to 5 1 and 54 to 58, 
further comprising recovering the panto-compound. 

60. The method of any one of claims 14 to 34 and 54 to 58, further 
1 5 comprising recovering the pantothenate. 

61 . The method of any one of claims 1 to 14, 35 , 39 to 46, 48 to 51 and 54 to 

. » 

59, wherein the panto-compound is produced at a level greater than 2 g/L. 

20 62. A recombinant microorganism which overexpresses at least one Bacillus 

pantothenate biosynthetic enzyme. 

63 . The recombinant microorganism of claim 62, which overexpresses at 
least one Bacillus subtilis pantothenate biosynthetic enzyme. 

25 

64. The recombinant microorganism of claim 62 or 63, wherein the 
pantothenate biosynthetic enzyme is selected from the group consisting of ketopantoate 
hydroxymethyltransferase, pantothenate synthetase, aspartate-a-decarboxylase and 
ketopantoate reductase. 

30 

65. The recombinant microorganism of any one of claims 62 to 64, wherein 
the pantothenate biosynthetic enzyme is ketopantoate reductase. 

66. A recombinant microorganism which overexpresses aspartate-a- 
35 decarboxylase and has a deregulated isoleucine-valine (/7v) biosynthetic pathway. 
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67. A recombinant microorganism having a mutant coaXgene, said mutant 
coaX gene encoding reduced pantothenate kinase activity in said microorganism. 

68. The recombinant microorganism of claim t>7 further having a mutant 

5 coaA gene, said mutant coaA gene encoding reduced pantothenate 4cinase activity in said 
microorganism. 

69. A recombinant microorganism having a mutant coaX gene and optionally 
having a mutant coaA gene, said mutant microorganism having reduced pantothenate 

1 0 kinase activity as compared to a microorganism having wild-type coaA and -coaX genes. 

70. A recombinant microorganism comprising a vector comprising an 
isolated coaX gene. , 

i 

15 71. A recombinant microorganism that overproduces a panto-compound, the 

microorganism having a deregulated pantothenate biosynthetic pathway and having at 
least one mutation that results in a decrease in the capacity of the microorganism to 
synthesize Coenzyme A (CoA). 

20 72. The recombinant microorganism of claim 71 , having at least one 

mutation that results in a reduced level of pantothenate kinase activity. 

73 . The recombinant microorganism of claim 72, having a mutation in a 
coaA gene, or homologue thereof, that results in a reduced level of CoaA -enzyme 

25 activity. 

74. The recombinant microorganism of-claim 72, having a mutation in a 
coaX gene, or homologue thereof, that results in a reduced level of CoaX^enzyme 
activity. 

30 

75. The recombinant microorganism of claim 72, having a mutation in a 
coaA gene, or homologue thereof, and having a mutation in &coaX gene, or homologue 
thereof, the mutations resulting in reduced levels of CoaA enzyme activity and reduced 
CoaX enzyme activity. 

35 

76. The recombinant microorganism of any one of claims 66 to 70 which 
further has a deregulated pantothenate biosynthetic pathway. 
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77. The recombinant microorganism of any one of claims 62 to 65 and 67 to 
75, further having a deregulated isoleucine-valine (ilv) biosynthetic pathway. 

5 78. The recombinant microorganism of any one of claims 62 to 77, which is 

Gram positive. 

79. The recombinant microorganism of claim 78 belonging to a genus 
selected from the group consisting of Bacillus, Conryebacterium, Lactobacillus, 

1 0 Lactococci and Streptomyces. 

80. The recombinant microorganism of claim 79 belonging to the genus 
Bacillus. 

15 81. The recombinant microorganism of claim 80 which is Bacillus subtilis. 

82. A recombinant microorganism selected from the group consisting of 
PA221, PA235, PA236, PA313, PA410, PA402, PA403, PA41 1, PA412, PA413, 
PA303, PA327, PA328, PA401, PA340, PA342, PA404, PA405, PA374, PA354, 

20 PA365, PA377, PA651 and PA824. 

83 . A recombinant vector for use in the production of panto-compounds 
comprising a nucleic acid sequence which encodes at least one Bacillus pantothenate 
biosynthetic enzyme operably linked to regulatory sequences. 

84. The vector of claim 83, comprising a nucleic acid sequence which 
encodes at least one Bacillus subtilis pantothenate biosynthetic enzyme. 

85. The vector of claim 84, wherein the nucleic acid sequence encodes at 
30 least one of ketopantoate hydroxymethyltransferase, pantothenate synthetase, aspartate- 

a-decarboxylase and ketopantoate reductase. 

86. A recombinant vector comprising at least one nucleic acid sequence 
selected from the group consisting of SEQ ID NO:23, SEQ ID NO:25, SEQ ID NO:27, 

35 SEQ ID NO:29 and SEQ ID NO:59. 
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87. The vector of claim 84, wherein the nucleic acid sequence encodes 
ketopantoate reductase. 

88. A vector comprising a mutant coaXgene, -said mutant encoding a 
5 pantothenate kinase enzyme having reduced activity. 

89. A vector comprising an isolated ^oaA'gene. 

90. A vector comprising an isolated Bacillus cooA^gene. 

10 

91. A vector comprising an isolated Bacillus subtilis<oaX%&Nt. 

92. The vector of any one of claims 86 and 89 to pi , which further comprises 
, regulatory sequences. 

15 

93. The vector of any one of claims 83 to 85, 87 and 92, wherein the 
regulatory sequences comprise a constitutively active promoter. 

94. The vector of claim 93, wherein the constitutively active promoter 
20 comprises P veg (SEQ ID NO:41), Pis(SEQ ID NO:39) or P 26 (SEQ ID NO:40) 

sequences. 

95. The vector of claim 83, wherein the regulatory sequences comprise at 
least one artificial ribosome binding site (RBS). 

25 

96. The vector of claim 95, wherein the artificial RBS comprises a nucleotide 
sequence selected from the group consisting of SEQ ID NO:49, SEQ,IDNO:50, SEQ ID 
NO:51, SEQ ID NO:52, SEQ ID NO:53, SEQ ID NO:54, SEQ ID NO:55,SEQ ID 
NO:56andSEQ IDNO:57. 

30 

97. A vector selected from the group consisting of pAN004, pANG05, 
pAN006, pAN236, pAN423, pAN428, pAN429, pAN441, pAN442, pAN443, pAN251, 
pAN267, pAN256, pAN257, pAN263, pAN240, pAN294, pAN296, pAN336, pAN341 
andpAN342. 

35 

98 . A recombinant microorganism comprising the vector of claim 86 or 93 . 



8NSOOCia <WO 0121772A3. IA> 



WO 01/021772 PCMJSOO/25993 

t 

-111- 

99. An isolated nucleic acid molecule which encodes at least one Bacillus 
pantothenate biosynthetic gene. 

100. The isolated nucleic acid molecule of claim 99 which encodes at least 
5 one Bacillus subtilis pantothenate biosynthetic gene. 

101. The isolated nucleic acid molecule of claim 99 or 1 00 which encodes 
ketopantoate reductase. 

10 1 02. An isolated Bacillus pantothenate biosynthetic enzyme polypeptide. 

1 03 . An isolated Bacillus subtilis pantothenate biosynthetic enzyme 
polypeptide. 

1 5 104. An isolated Bacillus ketopantoate reductase polypeptide. 

105. An isolated Bacillus subtilis ketopantoate reductase polypeptides 

« 

1 06. An isolated Bacillus aspartate-a-decarboxylase polypeptide. 

20 

1 07. An isolated Bacillus subtilis aspartate-a-decarboxylase polypeptide. 

108. An isolated nucleic acid molecule comprising a mutant coaX gene. 
25 1 09. An isolated nucleic acid molecule comprising a coaX gene. 

110. An isolated pantothenate kinase protein encoded by a coaX gene. 
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SEQUENCE LISTING 

<110> OMNIGENE BIOPRODUCTS 

<120> METHODS AND MICROORGANISMS FOR PRODUCTION OF 
PANTO-COMPOUNDS 

<130> BGI-141CPPC 

<140> 
<141> 

<150> USSN 09/400,494 
<151> 1999-09-21 

<150> USSN 60/210,072 
<151> 2000-06-07 

<150> USSN 60/221,836 
<151> 2000-07-28 

<150> USSN 60/221,836 • 
<151> 2000-08-24 

<160> 94 

<170> Patentln Ver. 2.0 

<210> 1 
<211> 311 
<212> PRT 

<213> Haemophilus influenzae 
<400> 1 

Met Glu Phe Ser Thr Gin Gin Thr Pro Phe Leu Ser Phe Asn Arg Glu 
1 5 10 15 

Gin Trp Ala Glu Leu Arg Lys Ser Val Pro Leu Lys Leu Thr Glu Gin 
20 25 30 

Asp Leu Lys Pro Leu Leu Gly Phe Asn Glu Asp Leu Ser Leu Asp Glu 
35 40 45 

Val Ser Thr lie Tyr Leu Pro Leu Thr Arg Leu lie Asn Tyr Tyr lie 
50 55 60 

Asp Glu Asn Leu His Arg Gin Thr Val Leu His Arg Phe Leu Gly Arg 
65 70 75 80 

Asn Asn Ala Lys Thr Pro Tyr lie lie Ser He Ala Gly Ser Val Ala 
85 90 95 

Val Gly Lys Ser Thr Ser Ala Arg He Leu Gin Ser Leu Leu Ser His 
100 105 110 

Trp Pro Thr Glu Arg Lys Val Asp Leu He Thr Thr Asp Gly Phe Leu 
115 120 125 



WO 01/021772 



PCT/US00/25993 



-2- 

Tyr Pro Leu Asn Lys Leu Lys Gin Asp Asn Leu Leu Gin Lys Lys Gly 
130 135 140 

Phe Pro Val Ser Tyr Asp Thr Pro Lys Leu lie Arg Phe Leu Ala Asp 
,145 150 155 160 

Val Lys Ser Gly Lys Ser Asn Val Thr Ala Pro lie Tyr Ser His Leu 
165 170 ' 175 

Thr Tyr Asp lie He Pro Asp Lys Phe Asp Val Val Asp Lys Pro Asp 
180 185 190 

He Leu He Leu Glu Gly Leu Asn Val Leu. Gin Thr Gly Asn Asn Lys 
195 200 205 

Thr Asp Gin Thr Phe Val Ser Asp Phe Val Asp Phe Ser He Tyr Val 
.210 215 220 

Asp Ala Glu Glu Lys Leu Leu Lys Glu Trp Tyr He Lys Arg Phe Leu 
225 230 " 235 240 

• Lys Phe Arg Glu Ser Ala Phe Asn Asp Pro Asn Ser Tyr Phe Lys His 
245 " 250 255 

Tyr Ala Ser Leu Ser Lys Glu Glu Ala He Ala Thr Ala Ser Lys He 
260 265 270 

Trp Asp Glu He Asn Gly Leu Asn Leu Asn Gin Asn He Leu -Pro Thr 
275 280 285 

Arg Glu Arg Ala Asn Leu lie Leu Lys Lys Gly His Asn His Gin Val 
290 295 300 

Glu Leu He Lys Leu Arg Lys 
305 310 



<210> 2 
<211> 316 
<212> PRT 

<213> Escherichia coli 
<400> 2 

Met Ser He Lys Glu Gin Thr Leu Met Thr Pro Tyr Leu Gin Phe Asp 
1 5 10 15 

Arg Asn Gin Trp Ala Ala Leu Arg Asp Ser Val Pro Met Thr Leu Ser 
20 25 30 

Glu Asp Glu He Ala Arg Leu Lys Gly lie Asn Glu Asp Leu Ser Leu 
35 40 45 

Glu Glu Val Ala Glu He Tyr Leu Pro Leu Ser Arg Leu Leu Asn Phe 
50 55 60 

Tyr He Ser Ser Asn lieu Arg Arg Gin Ala Val Leu Glu Gin Phe Leu 
65 70 75 SO 
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Gly Thr Asn Gly Gin Arg He Pro Tyr He He Ser He Ala Gly Ser 
85 90 95 

Val Ala Val Gly Lys Ser Thr Thr Ala Arg Val Leu Gin Ala Leu Leu 
100 , 105 no 

Ser Arg Trp Pro Glu His Arg Arg Val Glu Leu He Thr Thr Asp Gly 
H5 120 125 

Phe Leu His Pro Asn Gin Val Leu Lys <Slu Arg Gly Leu Met Lys Lys 
130 135 140 

Lys Gly Phe Pro Glu Ser Tyr Asp Met His Arg Leu Val Lys Phe Val 
i45 150 155 160 

Ser Asp Leu Lys Ser Gly Val Pro Asn Val Thr Ala Pro Val Tyr Ser 
1^5 170 175 

His Leu He Tyr Asp Val He Pro Asp Gly Asp Lys Thr Val Val Gin 
180 185 190 

Pro Asp He Leu He Leu Glu Gly Leu Asn Val Leu Gin Ser Gly Met 
195 200 205 

Asp Tyr Pro His Asp Pro His His Val Phe Val Ser Asp Phe Val Asp 
210 215 220 

Phe Ser He Tyr Val Asp Ala Pro Glu Asp Leu Leu Gin Thr Trp Tvr 
225 230 235 240 

He Asn Arg Phe Leu Lys Phe Arg Glu Gly Ala Phe Thr Asp Pro Asp 
245 250 255 

Ser Tyr Phe His Asn Tyr Ala Lys Leu Thr Lys Glu Glu Ala He Lys 
260 265 270 

Thr Ala Met Thr Leu Trp Lys Glu He Asn Trp Leu Asn Leu Lys Gin 
275 280 285 

Asn lie Leu Pro Thr Arg Glu Arg Ala Ser Leu He Leu Thr Lys Ser 
290 295 300 

Ala Asn His Ala Val Glu Glu Val Arg Leu Arg Lys 
305 310 315 



<210> 3 
<211> 319 
<212> PRT 

<213> Bacillus subtilis 
<400> 3 

Met Lys Asn Lys Glu Leu Asn Leu His Thr Leu Tyr Thr Gin His Asn 
15 10 ~ 15 



Arg Glu Ser Trp Ser Gly Phe Gly Gly His Leu Ser He Ala Val Ser 
20 25 30 
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Glu Glu Glu Ala Lys Ala Val Glu Gly Leu Asn Asp Tyr Leu Ser Val 
35 40 45 

Glu Glu Val Glu Thr lie Tyr lie Pro Leu Val Arg Leu Leu His Leu 
, 50 55 60 

His Val Lys Ser Ala Ala Glu Arg Asn Lys His Val Asn Val Phe Leu 
65 70 75 • 80 

Lys His Pro His Ser Ala Lys lie Pro Phe lie lie Gly lie Ala Gly 
85 90 95 

Ser Val Ala Val Gly Lys Ser Thr Thr Ala Arg lie Leu Gin Lys Leu 
100 105 110 

Leu Ser Arg Leu Pro Asp Arg Pro Lys Val Sei Leu lie Thr Thr Asp 
115 120 125 

Gly Phe Leu Phe Pro Thr Ala Glu Leu Lys Lys Lys Asn Met Met Ser 
130 135 140 . 

, Arg Lys Gly Phe Pro Glu Ser Tyr Asp Val Lys Ala Leu Leu Glu Phe 
145 150 155 160 

Leu Asn Asp Leu Lys Ser Gly Lys Asp Ser Val Lys Ala Pro Val Tyr 
165 170 175 

Ser His Leu Thr Tyr Asp Arg Glu Glu Gly Val Phe Glu Val Val Glu 
180 185 190 

Gin Ala Asp He Val He He -Glu Gly He Asn Val Leu Gin Ser Pro 
195 200 205 

Thr Leu Glu Asp Asp Arg Glu Asn Pro Arg He Phe Val Ser Asp Phe 

210 215 220 ■ • 

Phe Asp Phe Ser He Tyr Val Asp Ala Glu Glu Ser Arg He Phe Thr 
225 230 235 240 

' Trp Tyr Leu Glu Arg Phe Arg *Leu Leu Arg Glu Thr Ala Phe Gin Asn 
245 250 255 

Pro Asp Ser Tyr Phe His Lys Phe Lys Asp Leu Ser Asp Gin Glu Ala 
260 265 270 

Asp Glu Met Ala Ala Ser He Trp Glu Ser Val Asn Arg Pro Asn Leu 
275 280 285 

Tyr Glu Asn He Leu Pro Thr Lys Phe! Arg Ser Asp Leu He Leu Arg 
290 295 300 



Lys Gly Asp Gly His Lys Val Glu Glu Val Leu Val Arg Arg Val 
305 310 315 



<210> 4 
<211> 312 
<212> PRT 
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<213> Mycobacterium leprae 
<400> 4 

Met Pro Arg Leu Ser Glu Pro Ser Pro Tyr Val Glu Phe Asp- Arg Lys 
1,5 10 15 

Gin Trp Arg Ala Leu Arg Met Ser Thr Pro Leu Ala Leu Thr Glu Glu 
20 25 30 

Glu Leu He Gly Leu Arg Gly Leu Gly Glu Gin He Asp Leu Leu Glu 
35 40 45 

Val Glu Glu Val Tyr Leu Pro Leu Ala Arg Leu He His Leu Gin Val 
50 55 60 

Ala Ala Arg Gin Arg Leu Phe Ala Ala Thr Ala Glu Phe Leu Gly Glu 
65 70 75 80 

Pro Gin Gin Asn Pro Gly Arg Pro Val Pro Phe He lie Gly Val Ala 
85 90 95 

Gly Ser Val Ala Val Gly Lys Ser Thr Thr Ala Arg Val Leu Gin Ala 
100 105 110 

Leu Leu Ala Arg Trp Asp His His Thr Arg Val Asp Leu Val Thr Thr 
115 120 125 

Asp Gly Phe Leu Tyr Pro' Asn Ala Glu Leu Gly Arg Arg Asn Leu Met 
130 135 140 

His Arg Lys Gly Phe Pro Glu Ser Tyr Asn Arg Arg Ala Leu Met Arg 
145 150 155 160 

Phe Val Thr Ser Val Lys Ser Gly Ala Asp Tyr Ala Cys Ala Pro Val 
165 170 175 

Tyr Ser His Leu Arg Tyr Asp Thr He Pro Gly Ala Lys His Val Val 
180 185 190 

Arg His Pro Asp He Leu He Leu Glu Gly Leu Asn Val Leu Gin Thr 
195 200 205 

Gly Pro Thr Leu Met Val Ser Asp Leu Phe Asp Phe Ser Leu Tyr Val 
210 . 215 220 

Asp Ala Arg He Gin Asp He Glu Gin Trp Tyr Val Ser Arg Phe Leu 
225 230 235 240 

Ala Met Arg Gly Thr Ala Phe Ala Asp Pro Glu Ser His Phe His His 
245 250 255 

Tyr Ser Ala Leu Thr Asp Ser Lys Ala He He Ala Ala Arg Glu He 
260 265 270 

Trp Arg Ser He Asn Arg Pro Asn Leu Val Glu Asn He Leu Pro Thr 
275 280 285 

Arg Pro Arg Ala Thr Leu Val Leu Arg Lys Asp Ala Asp His Ser lie 
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290 295 300 

Asn Arg Leu Arg Leu Arg Lys Leu 
305 • 310 



<210> 5 
<211> 312 
<212> PRT 

<213> Mycobacterium tuberculosis 
<400> 5 

Met Ser Arg Leu Ser Glu Pro Ser Pro Tyr Val <31u Phe Asp Arg Arg 
1 5 10 15 

Gin Trp Arg Ala Leu Arg Met Ser Thr Pro Leu Ala Leu Thr Glu <31u 
20 25 30 

i 

Glu Leu Val Gly Leu Arg Gly Leu <Sly Glu Gin He Asp Leu Leu Glu 
35 40 45 

Val Glu Glu Val Tyr Leu Pro Leu Ala Arg Leu He His Leu Gin Val 
50 ~ 55 60 

Ala Ala Arg Gin Arg Leu Phe Ala Ala Thr Ala Glu Phe Leu Gly Glu 
65 70 ( 75 80 

Pro Gin Gin Asn Pro Asp Arg Pro Val Pro Phe He He Gly Val Ala 
85 90 95 

Gly Ser Val Ala Val Gly Lys Ser Thr Thr Ala Arg Val Leu Gin Ala 
100 105 110 

Leu Leu Ala Arg Trp Asp His His Pro Arg Val Asp Leu Val Thr Thr 
115 120 125 

Asp Gly Phe Leu Tyr Pro Asn Ala Glu Leu Gin Arg Arg Asn Leu Met 
130 135 140 

His Arg Lys Gly Phe Pro Glu Ser Tyr Asn Arg Arg Ala Leu Met Arg 
145 150 155 160 

Phe Val Thr Ser Val Lys Ser -Gly Ser Asp Tyr Ala <:ys Ala Pro Val 
165 170 175 

Tyr Ser His Leu His Tyr Asp He He Pro <31y Ala -Glu Gin Val Val 
180 185 190 

Arg His Pro Asp He Leu He Leu Glu Gly Leu Asn Val Leu Gin Thr 
195 200 205 

Gly Pro Thr Leu Met Val Ser Asp Leu Phe Asp Phe Ser Leu Tyr Val 
210 215 *220 

Asp Ala Arg He Glu Asp He Glu Gin Trp Tyr Val Ser Arg Phe Leu 
225 230 235 240 

Ala Met Arg Thr Thr Ala Phe Ala Asp Pro Glu Ser His Phe His His 
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245 250 255 

Tyr Ala Ala Phe Ser Asp Ser Gin Ala Val Val Ala Ala Arg Glu He 
260 265 270 

Trp Arg Thr He Asn Arg Pro Asn Leu Val Glu Asn He Leu Pro Thr 
' 275 280 285 

Arg Pro Arg Ala Thr Leu Val Leu Arg Lys Asp Ala Asp His Ser He 
290 295 300 

Asn Arg Leu Arg Leu Arg Lys Leu 
305 310 



<210> 6 
<211> 329 
<212> PRT 

<213> Streptomyces coelicolor 
<400> 6 

Met He Ser Pro Val Pro Ser He Pro Arg Ser Ala His Arg Gin Arg 
1 5 10 15 

Pro Glu Ala Thr Pro Tyr Val Asp Leu Thr Arg Pro Glu Trp Ser Ala 
20 25 "* 30 

Leu Arg Asp Lys Thr Pro Leu Pro Leu Thr Ala Glu Glu Val Glu Lys 
35 40 45 

Leu Arg Gly Leu Gly Asp Val He Asp Leu Asp Glu Val Arg Asp He 
50 55 60 

Tyr Leu Pro Leu Ser Arg Leu Leu Asn Leu Tyr Val Gly Ala Thr Asp 
65 70 75 80 

Gly Leu Arg Gly Ala Leu Asn Thr Phe Leu Gly Glu Gin Gly Ser Gin 
85 90 95 

Ser Gly Thr Pro Phe Val He Gly Val Ala Gly Ser Val Ala Val Gly 
100 105 " 110 

Lys Ser Thr Val Ala Arg Leu Leu Gin Ala Leu Leu Ser Arg Trp Pro 
115 120 125 

Glu His Pro Arg Val Glu Leu Val Thr Thr Asp Gly Phe Leu Leu Pro 
130 135 ~ 140 

Thr Arg Glu Leu Glu Ala Arg Gly Leu Met Ser Arg Lys Gly Phe Pro 
145 150 155 ~ 160 

Glu Ser Tyr Asp Arg Arg Ala Leu Thr Arg Phe Val Ala Asp He Lys 
165 170 175 

Ala Gly Lys Ala Glu Val Thr Ala Pro Val Tyr Ser His Leu He Tyr 
180 185 190 

Asp He Val Pro Asp Gin Arg Leu Val Val Arg Arg Pro Asp He Leu 
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195 200 205 

lie Val Glu Gly Leu Asn Val Leu Gin Pro Ala. Leu Pro Gly Lys Asp 
210 , 215 220 

Gly Arg Thr Arg Val Gly Leu Ala Asp Tyr Phe Asp Phe Ser Val Tyr 
225 230 , 235 240 

Val Asp Ala Arg Thr Glu Asp lie Glu Arg Trp Tyr Leu Asn Arg Phe 
245 250 255 

Arg Lys Leu Arg Ala Thr Ala Phe Gin Asn Pro Ser Ser Tyr Phe Arg 
260 265 270 

Lys Tyr Thr Gin Val Ser Glu Glu Glu Ala Leu Asp Tyr Ala Arg Thr 
275 280 285 

Thr Trp Arg Thr He Asn Lys Pro Asn Leu Val Glu Asry Val Ala Pro 
290 295 : 300 

Thr Arg Gly Arg Ala Thr Leu Val Leu Arg Lys Gly Pro Asp His Lys 
305 ~ " 310 315 . 320 

Val Gin Arg Leu Ser Leu Arg Lys Leu 
325 



<210> 7 
<211> 265 
<212> PRT 

<213> Streptomyces coeli-color 
<400> 7 

Met Leu Leu Thr He Asp Val Gly Asn Thr His Thr 'Val Leu Gly Leu . 
1 5 10 15 

Phe Asp Gly Glu Asp He Val Glu His Trp Arg He Ser Thr Asp ISer 
20 25 30 

Arg Arg Thr Ala Asp Glu Leu Ala Val Leu Leu Gin Gly Leu Met Gly 
35 40 45 

Met His Pro Leu Leu Gly Asp Glu Leu Gly Asp . Gly He Asp Gly He 
50 55 60 

Ala He Cys Ala Thr Val Pro Ser Val Leu His Glu Leu Arg Glu Val 
65 70 75 80 

Thr Arg Arg Tyr Tyr Gly Asp Val Pro Ala Val Leu Val Glu Pro Gly 
85 90 95 

Val Lys Thr Gly Val Pro He Leu Thr Asp His Pro Lys Glu Val -Gly 
100 105 HO 

Ala Asp Arg He He Asn Ala Val Ala Ala Val Glu Leu Tyr Gly Gly 
115 120 125 

Pro Ala He Val Val Asp Phe Gly Thr Ala Thr Thr Phe Asp Ala Val 
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130 



135 



140 



Ser Ala Arg Gly Glu Tyr lie Gly Gly Val He Ala Pro Gly He Glu 
145 150 155 ^ 160 

, He Ser Val Glu Ala Leu Gly Val Lys Gly Ala Gin Leu Arg Lys He 
165 170 175 

Glu Val Ala Arg Pro Arg Ser Val He Gly Lys Asn Thr Val, Glu Ala 
180 185 * 190 

Met Gin Ser Gly He Val Tyr Gly Phe Ala Gly Gin Val Asp Gly Val 
195 200 205 

Val Asn Arg Met Ala Arg Glu Leu Ala Asp Asp Pro Asp Asp Val Thr 
210 215 ' 220 

Val He Ala Thr Gly Gly Leu Ala Pro Met Val Leu Gly Glu Ser Ser 
225 230 235 240 

Val He Asp Glu His Glu Pro Trp Leu Thr Leu Met Gly Leu Arg Leu 
245 250 » 255 

Val Tyr Glu Arg Asn Val Ser Arg Met 



<210> 8 
<211> 272 
<212> PRT 

<213> Mycobacterium tuberculosis 
<400> 8 

Met Leu Leu Ala He Asp Val Arg Asn Thr His Thr Val Val Gly Leu 
1 5 10 15 

Leu Ser Gly Met Lys Glu His Ala Lys Val Val Gin Gin Trp Arg He 
20 25 30 

Arg Thr Glu Ser Glu Val Thr Ala Asp Glu Leu Ala Leu Thr He Asp 
35 40 45 

Gly Leu He Gly Glu Asp Ser Glu Arg Leu Thr Gly Thr Ala Ala Leu 
50 55 60 

Ser Thr Val Pro Ser Val Leu His Glu Val Arg He Met Leu Asp Gin 
65 70 75 80 

Tyr Trp Pro Ser Val Pro His Val Leu He Glu Pro Gly Val Arg Thr 
85 90 ^ 95 

Gly He Pro Leu Leu Val Asp Asn Pro Lys Glu Val Gly Ala Asp Arg 
100 105 HO 

He Val Asn Cys Leu Ala Ala Tyr Asp Arg Phe Arg Lys Ala Ala He 
115 120 ~ 125 

Val Val Asp Phe Gly Ser Ser He Cys Val Asp Val Val Ser Ala Lys 



260 



265 
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130 



135 



140 



Gly Glu Phe Leu Gly Gly Ala lie Ala Pro Gly Val Gin Val Ser Ser 
145 ' 150 155 160 

Asp Ala Ala Ala Ala Arg Ser Ala Ala Leu Arg Arg Val Glu Leu Ala 
165 170 175 

Arg Pro Arg Ser Val Val Gly Lys Asn Thr Val ^Glu Cys Met Gin Ala 
180 , 185 190 

Gly Ala Val Phe Gly Phe Ala Gly Leu Val Asp Gly Leu Val Gly Arg 
195 200 205 

He Arg Glu Asp Val Ser Gly Phe Ser Val Asp His Asp Val Ala He 
210 215 220 

Val Ala Thr Gly His Thr Ala Pro Leu Leu Leu Pro Glu» Leu His Thr 
225 230 235 240 

Val Asp His Tyr Asp Gin His Leu Thr Leu -Gin Gly Leu Arg Leu Val 
245 250 255 

Phe Glu Arg Asn Leu Glu Val Gin Arg Gly Arg Leu Lys Thr Ala Arg 
260 265 270 



<210> 9 
<211> 258 
<212> PRT 

<213> Bacillus subtilis 
<400> 9 

Leu Leu Leu Val He Asp Val Gly Asn Thr Asn Thr Val Leu Gly Val 
1 5 10 15 

Tyr His Asp Gly Lys Leu Glu Tyr His T-rp Arg lie Glu Thr Ser Arg 
20 25 30 

His Lys Thr Glu Asp Glu Phe Gly Met lie Leu Arg Ser Leu £he Asp 
35 40 45 

His Ser Gly Leu Met Phe Glu Gin He Asp Gly He lie He Ser Ser 
50 55 60 

Val Val Pro Pro He Met Phe Ala Leu Glu Arg Met Gys Thr Lys Tyr 
65 70 75 80 

Phe His He Glu Pro Gin He Val Gly Pro Gly Met Lys Thr Gly Leu 
85 90 95 

Asn He Lys Tyr Asp Asn Pro Lys Glu Val Gly Ala Asp Arg He Val 
100 105 HO 

Asn Ala Val Ala Ala He His Leu Tyr Gly Asn Pro Leu He Val Val 
115 120 125 
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Asp Phe Gly Thr Ala Thr Thr Tyr Cys Tyr He Asp Glu Asn Lys Gin 
130 135 140 

Tyr Met Gly Gly Ala He Ala Pro Gly He Thr He Ser Thr Glu Ala 
145 150 155 160 

Leu Tyr Ser Arg Ala Ala Lys Leu Pro Arg He Glu He Thr Arg Pro 
165 170 , 175 

Asp Asn He He Gly Lys Asn Thr Val Ser Ala Met Gin Ser Gly He 
180 185 190 

Leu Phe Gly Tyr Val Gly Gin Val Glu Gly He Val Lys Arg Met Lys 
195 200 205 

Trp Gin Ala Lys Gin Asp Leu Lys Val He Ala Thr Gly Gly Leu Ala 
210 215 220 

Pro Leu He Ala Asn Glu Ser Asp Cys He Asp He Val Asp Pro Phe 
225 230 235 * 240 

i * 

Leu Thr Leu Lys Gly Leu Glu Leu He Tyr Glu Arg Asn Arg Val Gly 
245 250 255 

Ser Val 



<210> 10 
<211> 262 
<212> PRT 

<213> Deinococcus radiopugnans 
<400> 10 

Met Pro Ala Phe Pro Leu Leu Ala Val Asp He Gly Asn Thr Thr Thr 
1 5 10 15 

Val Leu Gly Leu Ala Asp Ala Ser Gly Ala Leu Thr His Thr Trp Arg 
20 25 30 

He Arg Thr Asn Arg Glu Met Leu Pro Asp Asp Leu Ala Leu Gin Leu 
35 40 45 

His Gly Leu Phe Thr Leu Ala Gly Ala Pro He Pro Arg Ala Ala Val 
50 55 60 

Leu Ser Ser Val Ala Pro Pro Val Gly Glu Asn Tyr Ala Leu Ala Leu 
65 70 75 80 

Lys Arg His Phe Met He Asp Ala Phe Ala Val Ser Ala Glu Asn Leu 
85 90 95 

Pro Asp Val Thr Val Glu Leu Asp Thr Pro Gly Ser Val Gly Ala Asp 
100 105 * HO 

Arg Leu Cys Asn Leu Phe Gly Ala Glu Lys Tyr Leu Gly Gly Leu Asp 
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115 • 120 125 

Tyr Ala Val Val Va,l Asp Phe <51y Thr Ser Thr Asn Phe Asp Val Val 
130 135 140 

Gly Arg Gly Arg Arg Phe Leu Gly Gly He Leu Ala Thr Gly Ala *Gln 
145 * 150 155 160 

Val Ser Ala Asp Ala Leu Phe Ala Arg Ala Ala Lys Leu Pro Arg He 
165 170 ' 175 

Thr Leu Gin Ala Pro* Glu Thr Ala He Gly Lys Asn Thr Val His Ala 
180 185 190 

Leu Gin Ser Gly Leu Val Phe -Gly Tyr Ala -Glu Met Val Asp Gly Leu 
195 200 205 

t 

Leu Arg Arg He Arg Ala Glu Leu Pro Gly -Glu Ala Val Ala Val Ala 
210 215 220 

Thr Gly Gly Phe Ser Arg Thr Val -Gin Gly He -Cys Gin ^31u He Asp 
225 230 235 240 

Tyr Tyr Asp Glu Thr Leu Thr Leu Arg Gly Leu Val Glu Leu Trp Ala 
245 250 255 

Ser Arg Ser Glu Val Arg 
260 



<210> 11 
<211> 212 
<212> PRT 

<213> Desulfovibrio vulgaris 
<400> 11 

Met Thr Gin His Phe Leu Leu Phe Asp He Gly Asn Thr Asn Val Lys 
1 5 10 15 

He Gly He Ala Val Glu Thr Ala Val Leu Thr Ser Tyr Val Leu Pro 
20 25 30 

Thr Asp Pro Gly Gin Thr Thr Asp Ser He Gly l»eu Arg Leu Leu Glu 
35 40 45 

Val Leu Arg His Ala Gly Leu Gly Pro Ala Asp Val Gly Ala -Cys Val 
50 55 €0 

Ala Ser Ser Val Val Pro Gly Val Asn Pro Leu He Arg Arg Ala Cys 
65 70 75 80 

Glu Arg Tyr Leu Tyr Arg Lys Leu Leu Phe Ala Pro Gly Asp He Ala 
85 ' 90 95 

He Pro Leu Asp Asn Arg Tyr Glu Arg Pro Ala Glu Val Gly Ala Asp 
100 105 lit) 

Arg Leu Val Ala Ala Tyr Ala Ala Arg Arg Leu Tyr Pro -Gly Pro Arg 
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115 



120 



125 



Ser Leu Val Ser Val Asp Phe Gly Thr Ala Thr Thr Phe Asp Cys Val 
130 135 140 

, Glu Gly Gly Ala Tyr Leu Gly Gly Leu He Cys Pro Gly Val Leu Ser 
145 150 155 160 

Ser Ala Gly Ala Leu Ser Ser Arg Thr Ala Lys Leu Pro Ar§ He Ser 
165 170 175 

Leu Glu Val Glu Glu Asp Ser Pro Val He Gly Arg Ser Thr Thr Thr 
180 185 ~ 190 

Ser Leu Asn His Gly Phe lie Phe Gly Phe Ala Ala Met Thr Glu Gly 
195 200 205 



Val Leu Ala Ala 
210 



<210> 12 ' 
<211> 246 
<212> PRT 

<213> Thermotoga maritima 
<400> 12 

Met Tyr Leu Leu Val Asp Val Gly Asn Thr His Ser Val Phe Ser He 
1 5 10 15 

Thr Glu Asp Gly Lys Thr Phe Arg Arg Trp Arg Leu Ser Thr Gly Val 
20 25 30 

Phe Gin Thr Glu Asp Glu Leu Phe Ser His Leu His Pro Leu Leu Gly 
35 40 45 

Asp Ala Met Arg Glu He Lys Gly He Gly Val Ala Ser Val Val Pro 
50 55 60 

Thr Gin Asn Thr Val He Glu Arg Phe Ser Gin Lys Tyr Phe His He 
65 70 75 " 80 

Ser Pro lie Trp Val Lys Ala Lys Asn Gly Cys Val Lys Trp Asn Val 
85 90 " 95 

Lys Asn Pro Ser Glu Val Gly Ala Asp Arg Val Ala Asn Val Val Ala 
100 105 HO 

Phe Val Lys Glu Tyr Gly Lys Asn Gly He He He Asp Met Gly Thr 
115 120 125 

Ala Thr Thr Val Asp Leu Val Val Asn Gly Ser Tyr Glu Gly Gly Ala 
130 ( 135 140 

He Leu Pro Gly Phe Phe Met Met Val His Ser Leu Phe Arg Gly Thr 
I 45 150 155 160 

Ala Lys Leu Pro Leu Val Glu Val Lys Pro Ala Asp Phe Val Val Gly 
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1*5 170 175 

Lys Asp Thr Glu Glu Asn lie Arg Leu Gly Val Val Asn *Gly "Ser Val 
180 185 190 

Tyr Ala Leu Glu <31y He He Gly Arg He Lys Glu Val Tyr -Gly Asp 
195 200 205 

Leu Pro Val Val Leu Thr Gly Gly Gin Ser Lys He Val Lys Asp Met 
210 215 220 

He Lys His Glu lie Phe Asp t31u Asp Leu Thr lie Lys Gly Val Tyr 
225 230 235 240 

His Phe Cys Phe Gly Asp 
245 



<210> 13 
<211> 273 
<212> PRT 

<213> Treponema pallidum 
<400> 13 

Met Leu Leu He Asp Val Gly Asn Ser His Val Val Phe <31y He *Gln 
1 5 10 15 

Gly Glu Asn Gly Gly Arg Val Cys Val Arg Glu Leu Phe Arg Leu Ala 
20 25 30 

Pro Asp Ala Arg Lys Thr Gin Asp Glu Tyr Ser Leu Leu He His Ala 
35 40 45 

Leu Cys Glu Arg Ala Gly Val Gly Arg Ala -Ser Leu Arg Asp Ala Phe 
50 55 60 

He Ser Ser Val Val Pro Val Leu Thr Lys Thr He Ala Asp Ala Val 
65 70 75 80 

Ala Gin He Ser Gly Val Gin Pro Val Val Phe -Gly Pro Trp Ala Tyr 
85 90 95 

Glu His Leu Pro Val Arg He Pro *Glu Pro Val Arg Ala Glu He Gly 
100 105 HO 

Thr Asp Leu Val Ala Asn Ala Val Ala Ala Tyr Val His Phe Arg Ser 
115 120 125 

Ala Cys Val Val Val Asp Cys Gly Thr Ala Leu Thr Phe Thr Ala Val 
130 135 140 

Asp Gly Thr Gly Leu He Gin Gly Val Ala He Ala Pro <31y Leu Arg 
145 ~ 150 155 160 

Thr Ala Val Gin Ser Leu His Thr -Gly Thr Ala Gin Leu Pro Leu Val 
165 170 175 

Pro Leu Ala Leu Pro Asp Ser Val Leu Gly Lys Asp Thr Thr His Ala 



WO 01/021772 



PCT/US00/25993 



-15- 



180 



185 



190 



Val 



Gin Ala Gly 
195 



Val 



Val 



Arg 



Gly Thr 
200 



Leu Phe Val lie Arg 
205 



Ala Met 



' He Ala Gin Cys Gin Lys Glu Leu Gly Cys Arg Cys Ala Ala Val He 
210 215 220 

Thr Gly Gly Leu Ser Arg Leu Phe Ser Ser Glu Val Asp Phe Pro Pro 
225 230 235 ^ 240 

He Asp Ala Gin Leu Thr Leu Ser Gly Leu Ala His He Ala Arg Leu 
245 250 255 

Val Pro Thr Ser Leu Leu Pro Pro Ala Thr Val Ser Gly Ser Ser Gly 
260 265 270 

Asn 



<210> 14 1 
<211> 262 
<212> PRT 

<213> Borrelia burgdorferi 
<400> 14 

Met Asn Lys Pro -Leu Leu Ser Glu Leu He lie Asp lie Gly Asn Thr 
1 5 10 15 

Ser He Ala Phe Ala Leu Phe Lys Asp Asn Gin Val Asn Leu Phe He 
20 25 30 

Lys Met Lys Thr Asn Leu Met Leu Arg Tyr Asp Glu Val Tyr Ser Phe 
35 40 * 45 

Phe Glu Glu Asn Phe Asp Phe Asn Val Asn Lys Val Phe He Ser Ser 
50 55 60 

Val Val Pro He Leu Asn Glu Thr Phe Lys Asn Val He Phe Ser Phe 
65 70 75 80 

Phe Lys He Lys Pro Leu Phe He Gly Phe Asp Leu Asn Tyr Asp Leu 
85 90 " 95 

Thr Phe Asn Pro Tyr Lys Ser Asp Lys Phe Leu Leu Gly Ser Asp Val 
100 105 HO 

Phe Ala Asn Leu Val Ala Ala He Glu Asn Tyr Ser Phe Glu Asn Val 
115 120 125 

Leu Val Val Asp Leu Gly Thr Ala Cys Thr He Phe Ala Val Ser Arg 
130 135 140 

Gin Asp Gly He Leu Gly Gly lie He Asn Ser Gly Pro Leu He Asn 
145 150 155 160 

Phe Asn Ser Leu Leu Asp Asn Ala Tyr Leu He Lys Lys Phe Pro lie 
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165 170 175 

Ser Thr Pro Asn Asn Leu Leu *Glu Arg Thr Thr Ser Gly Ser Val Asn 
180 185 190 

Ser Gly Leu Phe Tyr Gin Tyr Lys Tyr Leu lie Glu Gly Val Tyr Arg 
195 200 205 

Asp He Lys Gin Met Tyr Lys Lys Lys Phe Asn Leu He He Thr ^Gly 
210 215 220 

Gly Asn Ala Asp Leu He Leu Ser Leu lie Glu lie *Glu Phe He Phe 
225 230 235 240 

Asn He His Leu Thr Val Glu Gly Val Arg lie Leu Gly Asn Ser Il« 
245 250 255 

Asp Phe Lys Phe Val Asn 
260 



<210> 15 
<211> 229 
<212> PRT 

<213> Aquifex aeolicus 
<400> 15 

Met Arg Phe Leu Thr Val Asp Val Gly Asn Ser Ser Val Asp He Ala 
15 10 15 

Leu Trp Glu Gly Lys Lys Val Lys Asp Phe Leu Lys Leu Ser His 'Glu , 
20 "* 25 30 

Glu Phe Leu Lys Glu Glu Phe Pro Lys Leu Lys Ala Leu -Gly He Ser . 
35 40 45 

Val Lys Gin Ser Phe Ser Glu Lys Val Arg Gly Lys He Pro Lys lie 
50 55 60 

Lys Phe Leu Lys Lys Glu Asn Phe Pro He <31n Val Asp Tyr Lys Thr 
65 70 75 B0 

Pro Glu Thr Leu Gly Thr Asp Arg Val Ala Leu Ala Tyr Ser Ala Xys 
85 90 95 

Lys Phe Tyr Gly Lys Asn Val Val Val He Ser Ala Gly Thr Ala Leu 
100 105 110 

Val He Asp Leu Val Leu Glu Gly Lys Phe Lys Gly Gly Phe He Thr 
115 120 125 

Leu Gly Leu Gly Lys Lys Leu Lys He Leu Ser Asp Leu Ala Glu Gly 
130 " 135 140 

He Pro Glu Phe Phe Pro Glu Glu Val -Glu He Phe Leu Gly Arg Ser 
145 150 155 160 

Thr Arg Glu Cys Val Leu Gly Gly Ala Tyr Arg Glu Ser Thr Glu Phe 
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165 170 175 

lie Lys Ser Thr Leu Lys Leu Trp Arg Lys Val Phe Lys Arg Lys Phe 
180 185 190- 

Lys Val Val He Thr Gly Gly Glu Gly Lys Tyr Phe Ser Lys Phe Gly 
195 200 205 

He Tyr Asp Pro Leu Leu Val His Arg Gly Met Arg Asn" Leu Leu Tyr 
210 215 220 

Leu Tyr His Arg He 
225 



<210> 16 

<211> 257 

<212> PRT 

<213> Synechocystis sp. 

<400> 16 

Met Glu Thr Ser Lys Pro Gly Cys Gly Leu Ala Leu Asp Asn Asp Lys 
1 5 io is 

Gin Lys Pro Trp Leu Gly Leu Met He Gly Asn Ser Arg Leu His Trp 
20 25 30 

Ala Tyr Cys Ser Gly Asn' Ala Pro Leu Gin Thr Trp Val Thr Asp Tyr 
35 40 45 

Asn Pro Lys Ser Ala Gin Leu Pro Val Leu Leu Gly Lys Val Pro Leu 
50 55 60 

Met Leu Ala Ser Val Val Pro Glu Gin Thr Glu Val Trp Arg Val Tyr 
65 70 75 80 

Gin Pro Lys He Leu Thr Leu Lys Asn Leu Pro Leu Val Asn Leu Tyr 
85 90 95 

Pro Ser Phe Gly He Asp Arg Ala Leu Ala Gly Leu Gly Thr Gly Leu 
100 105 HO 

Thr Tyr Gly Phe Pro Cys Leu Val Val Asp Gly Gly Thr Ala Leu Thr 
115 120 125 

He Thr Gly Phe Asp Gin Asp Lys Lys Leu Val Gly Gly Ala He Leu 
130 135 140 

Pro Gly Leu Gly Leu Gin Leu Ala Thr Leu Gly Asp Arg Leu Ala Ala 
i45 150 155 " " 160 

Leu Pro Lys Leu Glu Met Asp Gin Leu Thr Glu Leu Pro Asp Arg Trp 
165 170 175 

Ala Leu Asp Thr Pro Ser Ala He Phe Ser Gly Val Val Tyr Gly Val 
180 185 - 190 

Leu Gly Ala Leu Gin Ser Tyr Leu Gin Asp Trp Gin Lys Leu Phe Pro 
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195 



200 



205 



Gly Ala Ala Met Val He Thr -Gly Gly Asp <31y Lys He Leu His *<31y 

, 210 215 220 

Phe Leu Lys Glu His Ser Pro Asn Leu Ser Val Ala Trp Asp Asp Asn 
225 * 230 235 , 240 

Leu He Phe Leu Gly Met Ala Ala He His His Gly Asp Arg Pro He 
245 250 255 



Cys 



<210> 17 
<211> 223 
<212> PRT 

<213> Helicobacter pylori 
<400> 17 

Met Pro Ala Arg Gin Ser Phe Thr Asp Leu Lys Asn Leu Val Leu Cys 
1 5 10 15 

Asp He Gly Asn Thr Arg He His Phe Ala Gin Asn Tyr Gin Leu Phe 

20 1 ' ' ,25 * 30 

Ser Ser Ala Lys Glu Asp Leu Lys Arg Leu Gly He Gin Lys Glu He 
35 40 45 

Phe Tyr He Ser Val Asn Glu Glu Asn Glu Lys Ala Leu Leu Asn Cys 
50 55 60 

Tyr Pro Asn Ala Lys Asn He Ala Gly Phe Phe His Leu Glu Thr Asp 
65 70 75 80 

Tyr Val Gly Leu Gly He Asp Arg Gin Met Ala Cys Leu Ala Val Asn 
85 90 95 

Asn Gly Val Val Val Asp Ala Gly Ser Ala He -Thr lie Asp Leu lie 
100 ' 105 HO 

Lys Glu Gly Lys His Leu Gly Gly Cys He Leu Pro Gly Leu Ala -Gin 
115 120 125 

Tyr He His Ala Tyr Lys Lys Ser Ala Lys He Leu Glu -Gin Pro -Phe 
130 135 140 

Lys Ala Leu Asp Ser Leu Glu Val Leu Pro Lys Ser Thr Arg Asp Ala 
145 150 155 160 

Val Asn Tyr Gly Met Val Leu Ser Val He Ala Cys lie -Gin His Leu 
165 170 175 

Ala Lys Asn Gin Lys He Tyr Leu Cys Gly Gly Asp Ala Lys Tyr Leu 
180 185 190 

Ser Ala Phe Leu Pro His Ser Val Cys Lys Glu Arg Leu Val Phe Asp 
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195 



200 



205 



Gly Met Glu He Ala Leu Lys Lys Ala Gly He Leu Glu Cys Lys 



<210> 18 
<211> 267 
<212> PRT 

<213> Bordetella pertussis 
<400> 18 

Met He He Leu He Asp Ser Gly Asn Ser Arg Leu Lys Val Gly Trp 
1 5 10 ~ 15 

Phe Asp Pro Asp Ala Pro Gin Ala Ala Arg Glu Pro Ala Pro Val Ala 
20 25 30 

Phe Asp Asn Leu Asp Leu Asp Ala Leu Gly Arg Trp Leu Ala Thr Leu 
35 40 ^ ^ 

Pro Arg Arg Pro Gin Arg Ala Leu Gly Val Asn Val Ala Gly Leu Ala 
50 55 60 

Arg Gly Glu Ala He Ala Ala Thr Leu Arg Ala Gly Gly Cys Asp He 
65 70 75 80 

Arg Trp Leu Arg Ala Gin Pro Leu Ala Met Gly Leu Arg Asn Gly Tyr 
85 90 95 

Arg Asn Pro Asp Gin Leu Gly Ala Asp Arg Trp Ala Cys Met Val Gly 
100 105 " 110 

Val Leu Ala Arg Gin Pro Ser Val His Pro Pro Leu Leu Val Ala Ser 
115 120 125 

Phe Gly Thr Ala Thr Thr Leu Asp Thr He Gly Pro Asp Asn Val Phe 
130 135 140 

Pro Gly Gly Leu He Leu Pro Gly £ro Ala Met Met Arg Gly Ala Leu 
145 150 155 160 

Ala Tyr Gly Thr Ala His Leu Pro Leu Ala Asp Gly Leu Val Ala Asp 
165 170 175 

Tyr Pro He Asp Thr His Gin Ala He Ala Ser Gly He Ala Ala Ala 
180 185 190 

Gin Ala Gly Ala He Val Arg Gin Trp Leu Ala <31y Arg Gin Arg Tyr 
195 200 205 

Gly Gin Ala Pro Glu He Tyr Val Ala Gly Gly Gly Trp Pro Glu Val 
210 215 220 

Arg Gin Glu Ala Glu Arg Leu Leu Ala Val Thr Gly Ala Ala Phe Gly 
225 230 235 240 

Ala Thr Pro Gin Pro Thr Tyr Leu Asp Ser Pro Val Leu Asp Gly Leu 



210 



215 



220 
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245 250 255 



Ala Ala Leu Ala Ala Gin Gly Ala Pro Thr Ala 
260 265 



<210> 19 
<211> 777 
<212> DNA 

<213> Bacillus subtilis 

<220> 

<221> CDS 

<222> (1) . • (774) 



<400> 19 

ttg tta ctg gtt ate gat gtg ggg aac acc aat act gta ctt ggt gta 4*8 
Leu Leu Leu Val He Asp Val Gly Asn Thr Asn Thr Val Leu Gly Val 



1 



5 10 15 



' tat cat gat gga aaa tta gaa tat cac tgg <:gt ata gaa aca age -agg 96 
Tyr His Asp Gly Lys Leu Glu Tyr His Trp Arg He Glu Thr Ser Arg 
20 25 * 30 

' cat aaa aca gaa gat gag ttt ggg atg att ttg <=gc tec tta ttt gat 14 4 
His Lys Thr Glu Asp Glu Phe Gly* Met He Leu Arg Ser Leu Phe Asp 



35 40 45 

cac tec ggg ctt atg ttt gaa cag ata gat ggc att att att teg tea 
His Ser Gly Leu Met Phe Glu Gin lie Asp Gly He He He Ser Ser 
50 55 60 

gta gtg ccg cca ate atg ttt gcg tta ^gaa aga atg tgc aca aaa tac 
Val Val Pro Pro lie Met Phe Ala Leu Glu Arg Met Cys Thr Lys Tyr 
65 • 70 75 80 

ttt cat ate gag cct caa att gtt ggt cca ggt atg aaa acc ggt tta 
Phe His He Glu Pro Gin He Val Gly Pro Gly Met Lys Thr Gly Leu 
85 90 95 

aat ata aaa tat gac aat ccg aaa gaa gta ggg gca gac aga ate gta 
Asn He Lys Tyr Asp Asn Pro Lys -Glu Val Gly Ala Asp Arg He Val 
100 105 HO 

aat get gtc get gcg ata cac ttg tac ggc aat cca tta att gtt gtc 
Asn Ala Val Ala Ala He His Leu Tyr Gly Asn Pro Leu He Val Val 
115 120 . 125 

gat ttc gga acc gec aca acg tac tgc tat att gat - gaa aac aaa caa 
Asp Phe Gly Thr Ala Thr Thr Tyr Cys Tyr lie Asp Glu Asn Lys Gin 
130 135 140 

tac atg ggc ggg gcg att gee cct ggg att aca att teg aca gag gcg 
Tvr Met Gly Gly Ala He Ala Pro Gly He Thr He Ser Thr Glu Ala 
145 150 155 160 

ctt tac teg egt gca gca aag ctt oct cgt ate gaa ate acc egg cec 
Leu Tyr Ser Arg Ala Ala Lys Leu Pro Arg He Glu He Thr Arg Pro 



192 



240 



268 



336 



384 



432 



480 



528 



t 
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165 170 175 

gac aat att ate gga aaa aac act gtt age gcg atg caa tct gga att 576 

Asp Asn He He Gly Lys Asn Thr Val Ser Ala Met Gin Ser Gly He 

180 ■ 185 190 

tta ttt ggc tat gtc ggc caa gtg gaa gga ate gtt aag cga atg aaa 624 

Leu Phe Gly Tyr Val Gly Gin Val Glu Gly He Val Lys Arg Met Lys 

195 200 205 

tgg cag gca aaa cag gac etc aag gtc att gcg aca gga ggc ctg gcg 672 

Trp Gin Ala Lys Gin Asp Leu Lys Val He Ala Thr Gly Gly Leu Ala 

210 215 220 

ccg etc att gcg aac gaa tea gat tgt ata gac ate gtt gat cca ttc 720 

Pro Leu He Ala Asn Glu Ser Asp Cys He Asp He Val Asp Pro Phe 

225 230 235 * 240 

tta acc eta aaa ggg ctg gaa ttg att tat gaa aga aac cgc gta gga 768 

Leu Thr Leu Lys Gly Leu Glu Leu He Tyr Glu Arg Asn Arg Val Gly 

245 250 " 255 

agt gta tag 777 
Ser Val 



<210> 20 
<211> 960 
<212> DNA 

<213> Bacillus subtilis 

<220> 

<221> CDS 

<222> (1) . . (957) 

<400> 20 

gtg aaa aat aaa gaa ctt aac eta cat act tta tat aca cag cac aat 48 
Met Lys Asn Lys Glu Leu Asn Leu His Thr Leu Tyr Thr Gin His Asn 
1 5 ,10 15 

egg gag tct tgg tet ggt ttt ggg ggg cat ttg teg att get gta tct 96 
Arg Glu Ser Trp Ser Gly Phe Gly Gly His Leu Ser He Ala Val Ser 
20 25 30 

gaa gaa gag gca aaa get gtg gaa gga ttg aat gat tat eta tct gtt 144 
Glu Glu Glu Ala Lys Ala Val Glu Gly Leu Asn Asp Tyr Leu Ser Val 
35 40 45 

gaa gaa gtg gag acg ate tat att ccg ctt gtt cgc ttg ctt cat tta 192 
Glu Glu Val Glu Thr He Tyr He Pro Leu Val Arg Leu Leu His Leu 
50 55 60 

cat gtc aag tct gcg get gaa cgc aat aag cat gtc aat gtt ttt ttg 240 
His Val Lys Ser Ala Ala Glu Arg Asn Lys His Val Asn Val Phe Leu 
65 70 75 80 



aag cac cca cat tea gee aaa att ccg ttt att ate ggc att gee ggc 
Lys His Pro His Ser Ala Lys He Pro Phe He He Gly He Ala Gly 



288 
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85 90 95 

agt gtc gca gtc gga aaa age acg acg gcg egg ate ttg cag aag ctg 336 
,Ser Val Ala Val Gly Lys Ser Thr Thr Ala Arg lie Leu 'Gin Lys Leu 
100 105 110 

ett teg cgt ttg cct gae cgt cca aaa gtg age ctt ate acg . aca -gat 
Leu Ser Arg Leu Pro Asp Arg Pro Lys Val Ser Leu lie Thr Thr Asp 
115 120 125 

ggt ttt tta ttt cct act gec gag ctg aaa aag aaa aat ate. atg tea 
Gly Phe Leu Phe Pro Thr Ala <Glu Leu Lys Lys Lys Asn Met Met Ser 
130 135 140 

aga aaa gga ttt cct gaa age tat gat gta aag gcg ctg etc gaa ttt 
Arg Lys Gly Phe Pro Glu Ser Tyr Asp Val Lys Ala Leu l*eu Glu Phe 
145 ~ 150 155 160 

ttg aat gac tta aaa tea gga aag gae age gta aag gee ccg -gtg tat 
Leu Asn Asp Leu Lys Ser Gly Lys Asp Ser Val Lys Ala Pro Val Tyr 
165 170 175 



cag gcg gat att gtg att att gaa ggc att aat gtt ctt cag teg ccc 
Gin Ala Asp He Val He He Glu Gly He Asn Val Leu -Gin Ser Pro 
195 200 205 



aag gga gac ggg cat aag gtc gag gaa gtg ttg gta agg agg gta tga 
Lys Gly Asp Gly His Lys Val -Glu Glu Val Leu Val Arg Arg Val 
305 * 310 315 



384 



432 



480 



528 



tec cat eta acc tat gac cgc gag gaa ggt gtg ttc gag gtt gta gaa 576 
Ser His Leu Thr Tyr Asp Arg <51u Glu Gly Val Phe Glu Val Val Glu 
180 ^ 185 190 



624 



acc ttg gag gat gac egg gaa aac .ccg egt att ttt gtt tec gat ttc 672 
Thr Leu Glu Asp Asp Arg Glu Asn Pro Arg He Phe Val Ser Asp Phe 
210 ~ 215 220 

ttt gat ttt teg att tat gtg gat gcg gag gaa age egg att ttc act 720 
Phe Asp Phe Ser He Tyr Val Asp Ala Glu Glu Ser Arg He Phe Thr 
225 230 235 240 

tgg tat tta gag cgt ttt cgc -ctg ctt egg gaa aca get ttt -caa aat 
Trp Tyr Leu Glu Arg Phe Arg Leu Leu Arg Glu Thr Ala Phe Gin Asn 
245 250 255 

cct gat tea tat ttt eat aaa ttt aaa gac ttg tec gat cag gag get 
Pro Asp Ser Tyr Phe His Lys Phe Lys Asp Leu Ser Asp Gin 'Glu Ala 
260 265 270 

gac gag atg gca gee teg att tgg gag agt gtc aac -egg ccg aat tta 864 
Asp Glu Met Ala Ala Ser He Trp -Glu Ser Val Asn Arg Pro Asn Leu 
275 • 280 285 

tat gaa aat att ttg cca act aaa ttc agg tea gat etc att ttg cgt 912 
Tyr Glu Asn He Leu Pro Thr Lys Phe Arg -Ser Asp Leu He Leu Arg 
290 295 300 



768 



816 



960 
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<210> 21 
<211> 882 
<212> DNA 

<213> Bacillus subtilis 

<220> 

<221> CDS 

<222> (1) . . (879) 

<400> 21 

ttg teg att get gta* tct gaa gaa gag gca aaa get gtg gaa gga ttg 48 

Met Ser He Ala Val Ser Glu Glu Glu Ala Lys Ala Val Glu Gly Leu 

1 5 10 15 

aat gat tat eta tct gtt gaa gaa gtg gag acg ate tat att ccg ctt 96 
Asn Asp Tyr Leu Ser Val Glu Glu Val Glu Thr He Tyr He Pro Leu 
20 25 30 

i 

gtt cgc ttg ctt cat tta cat gtc aag tct gcg get gaa cgc aat aag 144 
Val Arg Leu Leu His Leu His Val Lys Ser Ala Ala Glu Arg Asn Lys 
35 40 45 

cat gtc aat gtt ttt ttg aag cac cca cat tea gee aaa att ccg ttt 192 
His Val Asn Val Phe Leu Lys His Pro His Ser Ala Lys He Pro Phe 
50 55 60 

att ate ggc att gee ggc agt gtc gca gtc gga aaa age acg acg gcg 240' 
He He Gly He Ala Gly Ser Val Ala Val Gly Lys Ser Thr Thr Ala 
65 70 75 80 

egg ate ttg cag aag ctg ctt teg cgt ttg cct gac cgt cca aaa gtg 288 
Arg He Leu Gin Lys Leu Leu Ser Arg Leu Pro Asp Arg Pro Lys Val 
85 90 95 

age ctt ate acg aca gat ggt ttt tta ttt cct act gee gag ctg aaa 336 
Ser Leu He Thr Thr Asp Gly Phe Leu Phe Pro Thr Ala Glu Leu Lys 
100 105 110 

aag aaa aat atg atg tea aga aaa gga ttt cct gaa age tat gat gta 384 
Lys Lys Asn Met Met Ser Arg Lys Gly Phe Pro Glu Ser Tyr Asp Val 
115 120 125 

aag gcg ctg etc gaa ttt ttg aat gac tta aaa tea gga aag gac age 432 
Lys Ala Leu Leu Glu Phe Leu Asn Asp Leu Lys Ser Gly Lys Asp Ser 
130 135 140 

gta aag gee ccg gtg tat tec cat eta ace tat gac cgc gag gaa ggt 480 
Val Lys Ala Pro Val Tyr Ser His Leu Thr Tyr Asp Arg Glu Glu Gly 
145 150 155 160 

gtg ttc gag gtt gta gaa cag gcg gat att gtg att att gaa ggc att 528 
Val Phe Glu Val Val Glu Gin Ala Asp He Val He He Glu Gly He 
1*5 170 175 



aat gtt ctt cag teg ccc acc ttg gag gat gac egg gaa aac ccg cgt 
Asn Val Leu Gin Ser Pro Thr Leu Glu Asp Asp Arg Glu Asn Pro Arg 
180 185 190 



576 
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att ttt gtt tec gat ttc ttt eat ttt teg att tat gtg gat gcg -gag 
lie Phe Val Ser Asp Phe Phe Asp Phe Ser He Tyr Val Asp Ala Glu 
195 200 205 

gaa age egg att ttc act tgg tat tta gag cgt ttt cgc ctg ctt egg 
Glu Ser Arg He Phe Thr Trp Tyr Leu -Glu Arg Phe Arg Leu Leu Arg 
210 215 220 

gaa aca get ttt caa aat cct gat tea tat ttt cat aaa ttt aaa gac 
Glu Thr Ala Phe Gin Asn Pro Asp Ser Tyr Phe His Lys Phe Lys Asp 
225 230 235 240 

ttg tec gat cag gag get gac gag atg gca gee teg att tgg gag agt 
Leu Ser Asp Gin Glu Ala Asp Glu Met Ala Ala Ser lie Trp Glu Ser 
245 • 250 255 

gtc aac egg ccg aat tta tat gaa aat att ttg cca act aaa ttc agg 
Val Asn Arg Pro Asn Leu Tyr Glu Asn He Leu Pro Thr Lys Phe Arg 
260 265 270 

tea gat etc att ttg cgt aag gga gac egg cat aag gtc gag gaa gbg 
Ser Asp Leu He Leu Arg Lys Gly Asp *Gly His Lys Val Glu Glu Val 
275 280 285 



€24 



672 



720, 



768 



816 



864 



■ ' ' 8fi2 

ttg gta agg agg gta tga 
Leu Val Arg Arg Val 
290 



<210> 22 
<211> 846 
<212> DNA 

<213> Bacillus subtilis 

<220> 

<221> CDS 

<222> (1) . . (843) 

<400> 22 

gtg gaa gga ttg aat gat tat eta tet -gtt gaa gaa gtg gag acg ate 4S 

Met Glu Gly Leu Asn Asp Tyr Leu Ser Val Glu *Glu Val Glu Thr He 
! 5 10 1* 



tat att ccg ctt gtt cgc ttg ctt eat tta eat gte aag tet gcg get 
Tyr He Pro Leu Val Arg Leu Leu His Leu His Val Lys Ser Ala Ala 
20 25 30 

gaa cgc aat aag cat gte aat gtt ttt ttg aag cac cca cat tea gee 
Glu Arg Asn Lys His Val Asn Val Phe Leu Lys His Pro His *Ser Ala 
35 .40 45 

aaa att ecg ttt att ate ggc att gee gge agt gte gca gte gga aaa 
Lys He Pro Phe He He Gly He Ala Gly Ser Val Ala Val Gly Lys 
50 55 «0 

age aeg acg gcg egg ate ttg cag aag cte. ett teg egt ttg ect gac 
Ser Thr Thr Ala Arg He Leu Gin Lys Leu Leu Ser Are. Leu Pro Asp 



96 



144 



192 



240 
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65 70 75 80 ' 

cgt cca aaa gtg age ctt ate acg aca gat ggt ttt tta ttt cct act 288 
Arg Pro Lys Val Ser Leu He Thr Thr Asp Gly Phe Leu Phe ' Pro Thr 
85 90 " 95 

gec gag ctg aaa aag aaa aat atg atg tea aga aaa gga ttt cct gaa 336 
Ala Glu Leu Lys Lys Lys Asn Met Met Ser Arg Lys Gly Phe Pro Glu 
100 105 ' 110 

age tat gat gta aag gcg ctg etc gaa ttt ttg aat gac tta aaa tea 384 
Ser Tyr Asp Val Lys Ala Leu Leu Glu Phe Leu Asn Asp Leu Lys Ser 
115 120 125 

gga aag gac age gta aag gee ccg gtg tat tec cat eta ace tat gac 432 
Gly Lys Asp Ser Val Lys Ala Pro Val Tyr Ser His Leu Thr Tyr Asp 
130 135 140 

cgc gag gaa ggt gtg ttc gag gtt gta gaa cag gcg ga,t att gtg att 480 
Arg Glu Glu Gly Val Phe Glu Val Val Glu Gin Ala Asp He Val He 
145 150 155 160 

att gaa ggc att aat gtt ctt cag teg ccc ace ttg gag gat gac egg 528 
He Glu Gly He Asn Val Leu Gin Ser Pro Thr Leu Glu Asp Asp Arg 
165 170 " 175 

gaa aac ccg cgt att ttt gtt tec gat ttc ttt gat ttt teg att tat 576 
Glu Asn Pro Arg He Phe Val Ser Asp Phe Phe Asp Phe Ser He Tyr 
180 185 190 

gtg gat gcg gag gaa age egg att ttc act tgg tat tta gag cgt ttt 624 
Val Asp Ala Glu Glu Ser Arg He Phe Thr Trp Tyr Leu Glu Arg Phe 
195 200 J 205 

cgc ctg ctt egg gaa aca get ttt caa aat cct gat tea tat ttt cat 672 
Arg Leu Leu Arg Glu Thr Ala Phe Gin Asn Pro Asp Ser Tyr Phe His 
210 215 220 

aaa ttt aaa gac ttg tec gat cag gag get gac gag atg gca gec teg 720 
Lys Phe Lys Asp Leu Ser Asp Gin Glu Ala Asp Glu Met Ala Ala Ser 
225 230 235 240 

att tgg gag agt gtc aac egg ccg aat tta tat gaa aat att ttg cca 768 
He Trp Glu Ser Val Asn Arg Pro Asn Leu Tyr Glu Asn He Leu Pro 
245 250 255 

act aaa ttc agg tea gat etc att ttg cgt aag gga gac ggg cat aag 816 
Thr Lys Phe Arg Ser Asp Leu He Leu Arg Lys Gly Asp Gly His Lys 
260 265 270 

gtc gag gaa gtg ttg gta agg agg gta tga 846 
Val Glu Glu Val Leu Val Arg Arg Val 
275 280 



<210> 23 
<211> 831 
<212> DNA 
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<213> Bacillus subtilis * 

<220> 

<221> CDS 

<222> (1) . . (831) 

<400> 23 

atg aaa aca aaa ctg gat ttt eta aaa atg aag gag tct gaa gaa ccg 

Met Lys Thr Lys Leu Asp Phe Leu Lys Met Lys Glu Ser Glu 'Glu Pro 

15' 10 15 

att gtc atg ctg acc get tat gat tat ccg gca get aaa ctt get gaa 
lie Val Met Leu Thr Ala Tyr Asp Tyr Pro Ala Ala Lys Leu Ala Glu 
20 25 30 



gtc etc ggc ctt gat tea act gtc ggt gtg aca gtt geg gac atg ate 
Val Leu Gly Leu Asp Ser Thr Val Gly Val Thr Val Ala Asp Met He 
50 55 60 . 



ctt gag ggc gga gaa ggc gtg ttt gaa tec att cge gca ttg acg ctt 
Leu Glu Gly Gly Glu <31y Val Phe <Slu Ser lie Arg Ala Leu Thr Leu 
115 "* 120 125 

gga ggc att cca gta gtc agt cae tta ggt ttg aca ccg cag tea gtc 
<Sly Gly He Pro Val Val Ser His Leu <31y Leu Thr Pro Gin Ser Val 
130 135 140 

ggc gta ctg ggc ggc tat aaa gta .cag ggc aaa gac gaa eaa age gee 
Gly Val Leu Gly Gly Tyr Lys Val Gin -Gly Lys Asp Glu Gin Ser Ala 
145 150 155 160 



atg ctt gtg ctg gaa tgt gtg ecg gca gaa cte aca gee aaa att gee 
Met Leu Val Leu Glu Cys Val Pro Ala Glu Leu Thr Ala Lys He Ala 
180 185 190 



48 



96 



caa gcg gga gtt gac atg att tta gtc ggt gat tea ett gga atg gtc 144 
Gin Ala Gly Val Asp Met He Leu Val Gly Asp Ser Leu»Gly Met Val 
35 40 45 



192 



cat cat aca aaa gec gtt aaa agg ggt ^fcg ccg aat ace ttt att gtg 240 
His His Thr Lys Ala Val Lys Arg Gly Ala Pro Asn Thr Phe He Val 
65 70 ' ' 75 80 

aca gat atg ccg ttt atg tct tat cae ctg tct aag gaa gat acg ctg 
Thr Asp Met Pro Phe Met Ser Tyr His Leu Ser Lys <Glu Asp Thr Leu 
85 90 95 



288 



aaa aat gca gcg get ate gtt cag gaa age gga get gac gca ctg aag 336 
Lys Asn Ala Ala Ala He Val Gin Glu Ser 'Gly Ala Asp Ala Leu Lys 
100 105 HO 



384 



432 



480 



aaa aaa tta ata gaa gac agt ata aaa .tgc gaa gaa gca gga -get atg 528 
Lys Lys Leu He Glu Asp Ser He Lys Cys <31u -Glu Ala <51y Ala Met 
165 170 175 



576 



gag acg eta age ata ccg gtc att -gga ate ggg get ggt gtg aaa gcg -624 
Glu Thr Leu Ser lie Pro Val He "Gly He Gly Ala <31y Val Lys Ala 
195 200 205 
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gac gga caa gtt etc gtt tat cat gat att ate ggc cac ggt gtt gag 672 
Asp Gly Gin Val Leu Val Tyr His Asp He He Gly His Gly Val Glu 
210 215 220 

aga aca cct aaa ttt gta aag caa tat acg cgc att gat gaa acc ate 720 

Arg Thr Pro Lys Phe Val Lys Gin Tyr Thr Arg He Asp Glu Thr He 
225 230 235 ' 240 

t 

gaa aca gca ate age gga tat gtt cag gat gta aga cat cgt get ttc 768 

Glu Thr Ala He Ser Gly Tyr Val Gin Asp Val Arg His Arg Ala Phe 

245 250 255 

cct gaa Caa aag cat tec ttt caa atg aac cag aca gtg ctt gac ggc 816 
Pro Glu Gin Lys His Ser Phe Gin Met Asn Gin Thr Val Leu Asp Gly 
260 265 270 

ttg tac ggg gga aaa 
Leu Tyr Gly Gly Lys 
275 



<210> 24 
<211> 277 
<212> PRT 

<213> Bacillus subtilis 
<400> 24 

Met Lys Thr Lys Leu Asp Phe Leu Lys Met Lys Glu Ser Glu <Slu Pro 
1 5 10 is 

He Val Met Leu Thr Ala Tyr Asp Tyr Pro Ala Ala Lys Leu Ala Glu 
20 25 ~ 30 

Gin Ala Gly Val Asp Met He Leu Val Gly Asp Ser Leu Gly Met Val 
35 40 45 

Val Leu Gly Leu Asp Ser Thr Val Gly Val Thr Val Ala Asp Met He 
50 55 60 

His His Thr Lys Ala Val Lys Arg Gly Ala Pro Asn Thr Phe He Val 
65 ™ 75 80 

Thr Asp Met Pro Phe Met Ser Tyr His Leu Ser Lys Glu Asp Thr Leu 
85 90 95 

Lys Asn Ala Ala Ala He Val Gin Glu Ser Gly Ala Asp Ala Leu Lys 
100 105 no 

Leu Glu Gly <31y Glu Gly Val Phe Glu Ser He Arg Ala Leu Thr Leu 
115 120 125 

Gly Gly He Pro Val Val Ser His Leu Gly Leu Thr Pro <31n Ser Val 
130 135 140 

Gly Val Leu Gly Gly Tyr Lys Val Gin Gly Lys Asp Glu Gin Ser Ala 
145 150 155 160 
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Lys Lys Leu He Glu Asp Ser He Lys Cys Glu Glu Ala <31y Ala Met 
165 170 175 

Met Leu Val Leu Glu Cys Val Pro Ala Glu Leu Thr Ala Lys lie Ala 
180 185 190 

Glu Thr Leu Ser He Pro Val He <31y He Gly Ala Gly Val Lys Ala 
195 200 205 

Asp Gly Gin Val Leu Val Tyr His Asp He He Gly His Gly Val Glu 
210 215 220 

Arg Thr Pro Lys Phe Val Lys Gin Tyr Thr Arg He Asp Glu Thr He 
225 ~ 230 235 240 

Glu Thr Ala He Ser Gly Tyr ^Val Gin Asp Val Arg His Arg Ala Phe 
245 250 255 

Pro Glu Gin Lys His Ser Phe Gin Met Asn -Gin Thr Val Leu Asp Gly 
260 265 270 

Leu Tyr Gly Gly Lys 
~ 275 



<210> 25 
<211> 858 
<212> DNA 

<213> Bacillus subtilis 

<220> 

<221> CDS 

<222> (1) . . (858) 

<400> 25 

atg aga cag att act gat att tea cag ctg aaa gaa gec ata aaa caa 

Met Arg Gin He Thr Asp He Ser Gin Leu Lys Glu Ala He Lys Gin 

1 5 10 15 

tac cat tea gag ggc aag tea ate gga ttt gtt -ccg acg atg ggg ttt 
Tyr His Ser Glu Gly Lys Ser He Gly Phe Val Pro Thr Met -Gly Phe 
20 25 30 



48 



96 



ctg cat gag ggg cat tta ace tta gca gac aaa gca aga =caa -gaa aac 144 
Leu His Glu Gly His Leu Thr Leu Ala Asp Lys Ala Arg Gin Glu Asn 
35 40 45 

gac gee gtt att atg agt att ttt gtg . aat <:ct gca caa ttc ggc cct 
Asp Ala Val He Met Ser He Phe Val Asn Pro Ala -Gin Phe Gly Pro 
50 55 €0 

aat gaa gat ttt gaa gca tat ccg cgc gat att gag egg gat gca get 
Asn Glu Asp Phe Glu Ala Tyr Pro Arg Asp He Glu Arg Asp Ala Ala 
65 * 70 75 80 

ctt gca gaa aac gec gga gtc gat att ctt ttt acg cca gat get cat 
Leu Ala Glu Asn Ala Gly Val Asp He Leu Phe Thr Pro Asp Ala His 
85 90 95 



192 



240 



288 
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gat atg tat ccc ggt gaa aag aat gtc acg att cat gta gaa aga cgc 336 

Asp Met Tyr Pro Gly Glu Lys Asn Val Thr He His Val Glu Arg Arg 

100 105 110 

aca gac gtg tta tgc ggg cgc tea aga gaa gga cat ttt gac ggg gtc 384 

Thr Asp Val Leu Cys Gly Arg Ser Arg Glu Gly His Phe Asp Gly Val 

115 120 " " 125 

gcg ate gta ctg acg aag ctt ttc aat eta gtc aag ccg act cgt gee 432 

Ala He Val Leu Thr Lys Leu Phe Asn Leu Val Lys Pro Thr Arg Ala 

130 135 140 

tat ttc ggt tta aaa gat gcg cag cag gta get gtt gtt gat ggg tta 480 

Tyr Phe Gly Leu Lys Asp Ala Gin Gin Val Ala Val Val Asp Gly Leu 

145 150 155 160 

ate age gac ttc ttc atg gat att gaa ttg gtt cct gtc gat acg gtc 528 

He Ser Asp Phe Phe Met Asp He Glu Leu Val Pro Val Asp Thr Val 

165 170 175 

aga gag gaa gac ggc tta gee aaa age tct cgc aat £ta tac tta aca 57 6 

Arg Glu Glu Asp Gly Leu Ala Lys Ser Ser Arg Asn Val Tyr Leu Thr 

180 185 190 

get gag gaa aga aaa gaa gcg cct aag ctg tat egg gee ctt caa aca 624 

Ala Glu Glu Arg Lys Glu Ala Pro Lys Leu Tyr Arg Ala Leu Gin Thr 

195 200 205 

agt gcg gaa ctt gtc caa gec ggt gaa aga gat cct gaa gcg gtg ata 672 
Ser Ala Glu Leu Val Gin Ala Gly Glu Arg Asp Pro Glu Ala Val He 

210 215 220 

aaa get gca aaa gat ate att gaa acg act age gga acc ata gac tat 720 

Lys Ala Ala Lys Asp lie lie Glu Thr Thr Ser Gly Thr He Asp Tyr 

225 230 235 240 

gta gag ctt tat tec tat ccg gaa etc gag cct gtg aat gaa att get 7 68 
Val Glu Leu Tyr Ser Tyr Pro Glu Leu Glu Pro Val Asn Glu He Ala 

245 250 255 

gga aag atg att etc get gtt gca gtt get ttt tea aaa gcg cgt tta 816 

Gly Lys Met lie Leu Ala Val Ala Val Ala Phe Ser Lys Ala Arg Leu 

260 265 270 

ata gat aat ate att att gat att cga gaa atg gag aga ata 858 

He Asp Asn lie lie lie Asp lie Arg Glu Met Glu Arg lie 

275 280 285 



<210> 26 
<211> 286 
<212> PRT 

<213> Bacillus subtilis 
<400> 26 

Met Arg Gin lie Thr Asp lie Ser Gin Leu Lys Glu Ala lie Lys Gin 
1 5 10 15 
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Tyr His Ser Glu Gly Lys Ser He Gly Phe Val Pro Thr Met Gly Phe 
20 25 30 

Leu His Glu Gly His Leu Thr Leu Ma Asp Lys Ala Arg Gin Glu Asn 
35 40 45 

Asp Ala Val He Met Ser He Phe Val Asn Pro Ala Gin Phe Gly Pro 

50 ,55 €0 , 

Asn Glu Asp Phe Glu Ala Tyr Pro Arg Asp He Glu Arg Asp Ala Ala 
65 '70 75 80 

Leu Ala Glu Asn Ala Gly Val Asp He Leu Phe Thr Pro Asp Ala -His 
85 90 95 

Asp Met Tyr Pro Gly Glu Lys Asn Val Thr He His Val, Glu Arg Arg 
100 105 HO 

Thr Asp Val Leu Cys Gly Arg Ser Arg Glu Gly His Phe Asp Gly Val 
115 ' 120 125 

Ala He Val Leu Thr Lys Leu Phe Asn Leu Val Lys Pro Thr Arg Ala 
130 135 140 

Tyr Phe Gly Leu Lys Asp Ala Gin Gin Val Ala Val Val Asp Gly Leu 
145 150 155 160 

He Ser Asp Phe Phe Met Asp He -Glu Leu Val Pro Val Asp Thr Val 
165 170 175 

Arg Glu Glu Asp Gly Leu Ala Lys Ser Ser Arg Asn Val Tyr Leu Thr 
180 185 190 

Ala Glu Glu Arg Lys Glu Ala Pro Lys Leu Tyr Arg Ala Leu Gin Thr 
195 200 205 

Ser Ala Glu Leu Val Gin Ala Gly Glu Arg Asp Pro Glu Ala Val He 
210 215 220 

Lys Ala Ala Lys Asp He He Glu Thr Thr Ser Gly Thr He Asp Tyr 
225 230 235 240 

Val Glu Leu Tyr Ser Tyr Pro Glu Leu Glu Pro Val Asn Glu He Ala 
245 250 255 

Gly Lys Met He Leu Ala Val Ala Val Ala Phe Ser Lys Ala Arg Leu 
260 265 270 

He Asp Asn He He He Asp He Arg Glu Met Glu Arg lie 
275 280 285 



<210> 27 
<211> 381 
<212> DNA 

<213> Bacillus subtilis 
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<220> 
<221> CDS 
<222> (1) . . (381) 

<400> 27 

, atg tat cga aca atg atg age ggc aaa ctt cac agg gca act gtt acg 48 

Met Tyr Arg Thr Met Met Ser Gly Lys Leu His Arg Ala Thr Val Thr 
15 10 15 

gaa gca aac ctg aac tat gtg gga age att aca att gat gaa gat etc 96 , 
Glu Ala Asn Leu Asn Tyr Val Gly Ser He Thr He Asp Glu Asp Leu 
20 25 30 

att gat get gtg gga atg ctt cct aat gaa aaa gta caa att gtg aat 144 
He Asp Ala Val Gly Met Leu Pro Asn Glu Lys Val Gin He Val Asn 
35 40 45 

aat aat aat gga gca cgt ctt gaa acg tat att att cct ggt aaa egg 192 
Asn Asn Asn Gly Ala Arg Leu Glu Thr Tyr He He Pro Gly Lys Arq 
50 55 " 60 

gga age ggc gtc ata tgc tta aac ggt gca gee gca ego ctt gtg cag 240 
. Gly Ser Gly Val He Cys Leu Asn Gly Ala Ala Ala Arg Leu Val Gin 
65 70 75 80 

gaa gga gat aag gtc att att att tec tac aaa atg atg tct gat caa 288 
Glu Gly Asp Lys Val He He lie Ser Tyr Lys Met Met Ser Asp Gin 
85 90 95 

gaa gcg gca age cat gag ccg aaa gtg get gtt ctg aat gat caa aac 336 
Glu Ala Ala Ser His Glu Pro Lys Val Ala Val Leu Asn Asp Gin Asn 
100 105 HO 

aaa att gaa caa atg ctg ggg aac gaa cca gec cgt aca att ttg 381 
Lys He Glu Gin Met Leu Gly Asn Glu Pro Ala Arg Thr He Leu 
115 120 125 

<210> 28 
<211> 127 
<212> PRT 

<213> Bacillus subtilis 
<400> 28 

Met Tyr Arg Thr Met Met Ser Gly Lys Leu His Arg Ala Thr Val Thr 
15 10 J 15 

Glu Ala Asn Leu Asn Tyr Val Gly Ser He Thr He Asp Glu Asp Leu 
20 25 30 

He Asp Ala Val Gly Met Leu Pro Asn Glu Lys Val Gin He Val Asn 
35 40 45 

Asn Asn Asn Gly Ala Arg Leu Glu Thr Tyr He He Pro Gly Lys Arg 
50 55 60 



Gly Ser Gly Val He Cys Leu Asn Gly Ala Ala Ala Arg Leu Val Gin 
65 70 75 80 
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Glu Gly Asp Lys Val He lie lie Ser Tyr Lys -Met Met Ser Asp <Gln 
85 90 95 

Glu Ala Ala Ser His Glu Pro Lys Val Ala Val Leu Asn Asp -Gin Asn 
100 , 105 HO 

Lys He Glu Gin Met Leu Gly Asn Glu Pro Ala Arg Thr He Leu 
115 120 125 



<210> 29 
<211> 894 
<212> DNA 

<213> Bacillus subtilis 

<220> i 

<221> CDS 

<222> (1) . . (894) 

<400> 29 

atg aaa att gga att ate ggc gga ggc tec gtt ggt ctt tta tgc gee 

Met Lys He Gly He He Gly Gly Gly Ser Val Gly Leu Leu Cys Ala 
! 5 10 15 

tat tat ttg tea ctt tat cac gac gtg act gtt gtg aeg agg egg caa 
Tvr Tyr Leu Ser Leu Tyr His Asp Val Thr Val Val Thr Arg Arg Gin 
20 25 30 



48 



96 



gaa cag get gcg gee att cag tct gaa gga ate .egg ctt tat aaa ggc 144 
Glu Gin ' Ala Ala Ala He -Gin Ser Glu Gly He Arg Leu Tyr Lys Gly 
35 40 45 

ggg gag gaa ttc agg get gat tgc agt gcg gac acg agt ate aat teg 192 
Giy Glu Glu Phe Arg Ala Asp Cys Ser Ala Asp Thr Ser He Asn Ser 
50 55 60 

gac ttt gac ctg ctt gtc gtg aca gtg aag cag cat -cag ctt -caa tct 240 
Asp Phe Asp Leu Leu Val Val Thr Val Lys -Gin His Gin Leu Gin Ser 
65 70 75 80 

gtt ttt teg teg ctt gaa cga ate ggg aag acg aat ata tta ttt ttg 
Val Phe Ser Ser Leu Glu Arg He Gly Lys Thr Asn He Leu Phe Leu 
85 90 95 

caa aac ggc atg ggg cat ate cac gac eta aaa gac tgg eac gtt ggc 
Gin Asn Gly Met Gly His He His Asp -Leu Lys Asp Trp His Val -Gly 
100 ios, no 

cat tec att tat gtt gga ate gtt gag cac gga get gta aga aaa teg 
His Ser He Tyr Val Gly He Val <31u His Gly Ala Val Arg Lys Ser 
115 120 125 

gat aca get gtt gat cat aca ggc eta ggt gcg ata aaa tgg age gcg 4 32 
Asp Thr Ala Val Asp His Thr Gly Leu Gly Ala He Lys Trp Ser Ala 
130 135 140 



28B 



336 



384 



ttc gac gat get gaa cca gac egg ctg aac ate ttg ttt eag eat aac 



480 
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Phe Asp Asp Ala Glu Pro Asp Arg Leu Asn lie Leu Phe Gin His Asn 
145 150 155 , 160 

cat teg gat ttt ccg att tat tat gag acg gat tgg tac cgt ctg ctg 528 
His Ser Asp Phe Pro He Tyr Tyr Glu Thr Asp Trp Tyr Arg Leu Leu 
165 170 ~ 175 

acg ggc aag ctg att gta aat gcg tgt att aat cct tta act gcg tta 576 
Thr Gly Lys Leu He Val Asn Ala Cys He Asn Pro Leu Thr Ala Leu 
180 185 190 

ttg caa gtg aaa aat gga gaa ctg ctg aca acg cca get tat ctg get 624 
Leu Gin Val Lys Asn Gly Glu Leu Leu Thr Thr Pro Ala Tyr Leu Ala 
195 200 205 

ttt atg aag ctg gta ttt cag gag gca tgc cgc att tta aaa ctt gaa 672 
Phe Met Lys Leu Val Phe Gin Glu Ala Cys Arg He Leu Lys Leu Glu 
210 215 220 

aat gaa gaa aag get tgg gag egg gtt cag gec gtt tgt ggg caa acg 720 
Asn Glu Glu Lys Ala Trp Glu Arg Val Gin Ala Val Cys Gly Gin Thr 
225 230 235 ' 240 

aaa gag aat cgt tea tea atg ctg gtt gac gtc att gga ggc egg cag 768 
Lys Glu Asn Arg Ser Ser Met Leu Val Asp Val He Gly Gly Arg Gin 
245 250 "* 255 

acg gaa get gac gee att ate gga tac tta ttg aag gaa gca agt ctt 816 
Thr Glu Ala Asp Ala He He Gly Tyr Leu Leu Lys Glu Ala Ser Leu 
260 265 270 

caa ggt ctt gat gee gtc cac eta gag ttt tta tat ggc age ate aaa 864 
Gin Gly Leu Asp Ala Val His Leu Glu Phe Leu Tyr Gly Ser He Lys 
275 280 285 

gca ttg gag cga aat aca aac aaa gtc ttt 894 
Ala Leu Glu Arg Asn Thr Asn Lys Val Phe 
290 295 



<210> 30 
<211> 298 
<212> PRT 

<213> Bacillus subtilis 
<400> 30 

Met Lys He Gly He He Gly Gly Gly Ser Val Gly Leu Leu Cys Ala 
1 5 10 15 

Tyr Tyr Leu Ser Leu Tyr His Asp Val Thr Val Val Thr Arg Arg Gin 
20 25 30 

Glu Gin Ala Ala Ala He Gin Ser Glu Gly He Arg Leu Tyr Lys Gly 
35 40 45 

Gly Glu Glu Phe Arg Ala Asp Cys Ser Ala Asp Thr Ser He Asn Ser 
50 55 60 
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Asp Phe Asp Leu Leu Val Val Thr Val Lys -Gin His Gin Leu Gin Ser 
€5 70 75 80 

Val Phe Ser Ser Leu XSlu Arg lie ^Gly Lys Thr Asn lie Leu Phe Leu 
85 90 95 

Gin Asn Gly Met Gly His He His Asp Leu Lys Asp Trp His Val Gly 
100 105 HO 

His Ser He Tyr Val Gly lie Val <31u His Gly Ala Val Arg Lys Ser 
115 ■ 120 125 

Asp Thr Ala Val Asp His Thr Gly Leu Gly Ala He Lys Trp Ser Ala 
130 135 140 

Phe Asp Asp Ala Glu Pro Asp Arg Leu Asn He Leu Phe Gin His Asn 
145 150 155 , 160 

His Ser Asp Phe Pro He Tyr Tyr -Glu Thr Asp Trp Tyr Arg Leu Leu 
W5 170 175 

Thr Gly Lys Leu He Val Asn Ala Cys He Asn Pro Leu Thr Ala Leu 
180 185 190 

Leu Gin Val Lys Asn Gly Glu Leu Leu Thr Thr Pro Ala Tyr Leu Ala 
195 ' ' 200 205 

Phe Met Lys Leu Val Phe Gin 'Glu Ala Cys Arg He Leu Lys Leu Glu 
210 215 220 

Asn Glu Glu Lys Ala Trp Glu Arg Val Gin Ala Val Cys Gly -Gin Thr 
225 230 235 240 

Lys Glu Asn Arg Ser Ser Met Leu Val Asp Val He Gly Gly Arg Gin 
245 250 255 

Thr Glu Ala Asp Ala He He -Gly Tyr Leu Leu Lys -Glu Ala -Ser Leu 
260 2*5 270 

Gin Gly Leu Asp Ala Val His Leu Glu Phe Leu Tyr <Gly Ser He Lys 
275 280 285 

Ala Leu Glu Arg Asn Thr Asn Lys Val Phe 
290 295 



<210> 31 
<211> 1725 
<212> DNA 

<213> Bacillus subtilis 



<220> 

<221> CDS 

<222> (1) ..{1722) 



<400> 31 

atg ggg act eat gta cag gtg gat tea gca tct gec gaa tgt aca cag 
Met Gly Thr Asn Val Gin Val Asp Ser Ala Ser Ala -Glu Cys Thr Gin 
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15 10 15 

acg atg age gga gca tta atg ctg att gaa tea tta aaa aaa gag aaa 96 

Thr Met Ser Gly Ala Leu Met Leu He Glu Ser Leu Lys Lys Glu Lys 

20 25 30 

gta gaa atg ate ttc ggt tat ccg ggc ggg get gtg ctt ccg att tac 144 , 

Val Glu Met He Phe Gly Tyr Pro Gly Gly Ala Val Leu Pro He Tyr 

35 40 45 

gat aag eta tac aat tea ggg ttg gta cat ate ctt ccc cgt cac gaa 192 

Asp Lys Leu Tyr Asn Ser Gly Leu Val His He Leu Pro Arg His Glu 

50 55 60 

caa gga gca att cat gca gcg gag gga tac gca agg gtc tec gga aaa 240 

Gin Gly Ala He His Ala Ala Glu Gly Tyr Ala Arg Val Ser Gly Lys 

65 70 75 80 

ccg ggt gtc gtc att gee acg tea ggg ccg gga gcg aea aac ctt gtt 288 

Pro Gly Val Val He Ala Thr Ser Gly Pro Gly Ala Thr Asn Leu Val 

85 90 95 

i 

aca ggc ctt get gat gee atg att gat tea ttg ccg tta gtc gtc ttt 336 

Thr Gly Leu Ala Asp Ala Met He Asp Ser Leu Pro Leu Val Val Phe 

100 105 no 

aca ggg cag gta gca ace tct gta ate ggg age gat gca ttt cag gaa 384 

Thr Gly Gin Val Ala Thr Ser Val He Gly Ser Asp Ala Phe Gin Glu 

115 120 125 

gca gac att tta ggg att acg atg cca gta aca aaa cac age tac cag 432 

Ala Asp He Leu Gly He Thr Met Pro Val Thr Lys His Ser Tyr Gin 

130 135 140 

gtt cgc cag ccg gaa gat ctg ccg cgc ate att aaa gaa gcg ttc cat 480 

Val Arg Gin Pro Glu Asp Leu Pro Arg He He Lys Glu Ala Phe His 

145 150 155 160 

att gca aca act gga aga ccc gga cct gta ttg att gat att ccg aaa 528 

He Ala Thr Thr Gly Arg Pro Gly Pro Val Leu He Asp He Pro Lys 

165 170 175 

gat gta gca aca att gaa gga gaa ttc age tac gat cat gag atg aat 576 

Asp Val Ala Thr He Glu Gly Glu Phe Ser Tyr Asp His Glu Met Asn 

180 185 190 

etc ccg gga tac cag ccg aca aca gag ccg aat tat ttg cag ate cgc 624 

Leu Pro Gly Tyr Gin Pro Thr Thr Glu Pro Asn Tyr Leu Gin He Arg 

195 200 205 

aag ctt gtg gaa gee gtg age agt gcg aaa aaa ccg gtg ate ctg gcg 672 

Lys Leu Val Glu Ala Val Ser Ser Ala Lys Lys Pro Val He Leu Ala 

210 215 ~ 220 



ggt gcg ggc gta ctg cac gga aaa gcg tea gaa gaa tta aaa aat tat 
Gly Ala Gly Val Leu His Gly Lys Ala Ser Glu Glu Leu Lys Asn Tyr 
225 230 235 240 



720 
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get gaa cag cag caa ate ! cct gtg ^ca cac acc ctt ttq ggg etc gga 7-68 
Ala Glu Gin Gin Gin lie Pro Val Ala His Thr Leu Leu Gly Leu "Gly 
245 250 255 

ggc ttc ccg get gac cat ccg ctt ttc eta ggg atg gcg gga atg cac 816 
Gly Phe Pro Ala Asp His Pro Leu Phe Leu Gly Met Ala Gly Met His 
260 265 270 

ggt act tat aca gec aat atg gee ctt cat gaa tgt gat eta tta ate 864 
Gly Thr Tyr Thr Ala Asn Met Ala Leu His -Glu Cys Asp Leu Leu He 
275 280 285 

agt ate ggc gee cgt ttt gat gac cgt gtc aca gga aac ctg aaa cac 912 
Ser He Gly Ala Arg Phe Asp Asp Arg Val Thr Gly Asn Leu Lys His 
290 295 300 

ttt gee aga aac gca aag ata gec cac ate gat att gat, cca get gaa 960 
Phe Ala Arg Asn Ala Lys He Ala His He Asp He Asp Pro Ala Glu 
305 310 315 320 

ate gga aaa ate atg aaa aca cag att cct gta gtc gga gac age aaa 1008 
He Gly Lys He Met Lys Thr Gin He Pro Val Val Gly Asp Ser Lys 
325 330 335 

att gtc ctg cag gag ctg ate aaa caa gac ggc aaa *caa age gat tea 1056 
He Val Leu Gin Glu Leu lie Lys Gin Asp Gly Lys Gin Ser Asp Ser 
340 345 350 

age gaa tgg aaa aaa cag etc gca gaa tgg aaa gaa gag tat ccg etc 1104 
Ser Glu Trp Lys Lys Gin Leu Ala Glu Trp Lys Glu -Glu Tyr -Pro Leu 
355 360 365 

tgg tat gta gat aat gaa gaa gaa ggt ttt aaa cct cag aaa ttg att 1152 
Trp Tyr Val Asp Asn Glu Glu Glu Gly Phe Lys Pro Gin Lys Leu He 
370 375 380 



gaa tat att cat caa ttt aca aaa gga gag gee att gtc gca acg ^gat 
Glu Tyr He His Gin Phe Thr Lys Gly Glu Ala He Val Ala Thr Asp 
385 390 395 400 

gta ggc cag cat caa atg tgg tea gcg caa ttt tat ccg ttc caa aaa 
Val Gly Gin His Gin Met Trp *Ser Ala Gin Phe Tyr Pro Phe Gin Lys 
405 410 415 



1200 



1248 



gca gat aaa tgg gtc acg tea ggc gga ctt gga acg atg gga ttc ggt 1296 
Ala Asp Lys Trp Val Thr Ser Gly Gly Leu Gly Thr Wet Gly Phe Gly 
420 425 430 

ctt ccg gcg gcg ate ggc gca cag ctg gee gaa aaa gat ^ct act gtt 134 4 
Leu Pro Ala Ala He Gly Ala Gin Leu Ala Glu Lys Asp Ala Thr Val 
435 440 445 



gtc gcg gtt gtc gga gac ggc gga ttc <:aa atg acg ctt caa gaa etc 
Val Ala Val Val Gly Asp Gly Gly Phe Gin Met Thr Leu Gin Glu Leu 
450 455 460 

gat gtt att cgc gaa tta aat ctt ccg gtc aag gta gtg att t*a aat 
Asp Val He Arg Glu Leu Asn Leu Pro Val Lys Val Val He Leu Asn 



1392 



1440 
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465 470 475 480 

aac get tgt etc gga atg gtc aga cag tgg cag gaa att ttc tat gaa 1488 
Asn Ala Cys Leu Gly Met Val Arg Gin Trp Gin Glu He Phe Tyr Glu 
485 490 495 

gaa cgt tat tea gaa tct aaa ttc get tct cag cct gac ttc gtc aaa 1536 ' 
Glu Arg Tyr Ser Glu Ser Lys Phe Ala Ser Gin Pro Asp Phe Val Lys 
500 505 510 

ttg tec gaa gca tac ggc att aaa ggc ate aga att tea tea gaa gcg 1584 
Leu Ser Glu Ala Tyr Gly He Lys Gly He Arg He Ser Ser Glu Ala 
515 520 525 

gaa gca aag gaa aag ctg gaa gag gca tta aca tea aga gaa cct gtt 1632 
Glu Ala Lys Glu Lys Leu Glu Glu Ala Leu Thr Ser Arg Glu Pro Val 
530 535 540 

gtc att gac gtg egg gtt gee age gaa gaa aaa gta ttc ccg atg gtg 1680 
Val He Asp Val Arg Val Ala Ser Glu Glu Lys Val Phe Pro Met Val 

545 5 50 555 560 

i 

get ccg ggg aaa ggg ctg cat gaa atg gtg ggg gtg aaa cct tga 1725 
Ala Pro Gly Lys Gly Leu His Glu Met Val Gly Val Lys Pro 
565 570 



<210> 32 
<211> 574 
<212> PRT 

<213> Bacillus subtilis 
<400> 32 

Met Gly Thr Asn Val Gin Val Asp Ser Ala Ser Ala Glu Cys Thr Gin 
1 5 io 15 

Thr Met Ser Gly Ala Leu Met Leu He Glu Ser Leu Lys Lys Glu Lvs ' 
20 25 30 

Val Glu Met He Phe Gly Tyr Pro Gly Gly Ala Val Leu Pro He Tyr 
35 40 45 

Asp Lys Leu Tyr Asn Ser Gly Leu Val His He Leu Pro Arg His Glu 
50 55 60 

Gin Gly Ala He His Ala Ala Glu Gly Tyr Ala Arg Val Ser Gly Lys 
65 70 75 ~ 80 

Pro Gly Val Val He Ala Thr Ser Gly Pro Gly Ala Thr Asn Leu Val 
85 90 95 

Thr Gly Leu Ala Asp Ala Met He Asp Ser Leu Pro Leu Val Val Phe 
100 105 no 

Thr Gly Gin Val Ala Thr Ser Val He Gly Ser Asp Ala Phe Gin Glu 
115 120 125 

Ala Asp He Leu Gly He Thr Met Pro Val Thr Lys His Ser Tyr Gin 
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130 



135 



140* 



Val Arg Gin Pro Glu Asp Leu -Pro Arg lie lie Lys Glu Ala Phe His 
145 " 150 155 160 

♦ 

lie Ala Thr Thr Gly Arg Pro Gly Pro Val Leu lie Asp He Pro Lys 
165 ^ 170 ,175 

Asp Val Ala Thr He Glu -Gly Glu Phe Ser Tyr Asp His -Glu Met Asn 
180 165 190 

Leu Pro Gly Tyr Gin Pro Thr Thr Glu Pro Asn Tyr Leu "Gin He Arg 
195 200 205 

Lys Leu Val Glu Ala Val Ser Ser Ala Lys Lys Pro Val He Leu Ala 
210 215 220 

Gly Ala Gly Val Leu His Gly Lys Ala Ser Glu Glu Leu Lys Asn Tyr 
225 " 230 235 240 

Ala Glu Gin Gin Gin He Pro Val Ala His Thr Leu Leu -Gly Leu Gly 
245 250 255 

Gly Phe Pro Ala Asp His Pro Leu Phe Leu Gly Wet Ala Gly Met His 
260 265 270 

Gly Thr Tyr Thr Ala Asn Met Ala Leu His -Glu Cys Asp Leu Leu He 
275 280 285 

Ser He Gly Ala Arg Phe Asp Asp Arg Val Thr Gly Asn Leu Lys His 
290 " " 295 300 

Phe Ala Arg Asn Ala Lys lie Ala His He Asp lie Asp -Pro Ala -Glu 
305 310 315 320 

He Gly Lys He Met Lys Thr Gin He Pro Val Val <Gly Asp Ser Lys 
325 330 335 

He Val Leu Gin Glu Leu lie Lys -Gin Asp ^Gly Lys -Gin Ser Asp Ser 
340 345 350 

Ser Glu Trp Lys Lys Gin Leu Ala -Glu Trp Lys -Glu Glu Tyr Pro Leu 
355 360 365 

Trp Tyr Val Asp Asn Glu Glu Glu <31y Phe Lys Pro Gin Lys Leu lie 
370 9 375 380 

•Glu Tyr He His Gin Phe Thr Lys Gly -Glu Ala He Val Ala Thr Asp 
385 390 395 400 

Val Gly Gin His Gin Met Trp Ser Ala Gin Phe Tyr Pro Phe Gin Lys 
405 410 415 



Ala Asp Lys Trp Val Thr Ser Gly Gly Leu Gly Thr Met Gly Phe Gly 
420 425 430 



Leu Pro Ala Ala lie Gly Ala Gin Leu Ala Glu Lys Asp Ala Thr Val 
435 440 445 
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Val Ala Val Val Gly Asp Gly Gly Phe Gin Met Thr Leu Gin Glu Leu 
450 1 455 460 

Asp Val He Arg Glu Leu Asn Leu Pro Val Lys Val Val He Leu Asn 
465 470 475 480 

Asn Ala Cys Leu Gly Met Val Arg Gin Trp Gin Glu He Phe Tyr Glu 
485 490 495 

Glu Arg Tyr Ser Glu Ser Lys Phe Ala Ser Gin Pro Asp Phe Val Lys 
500 505 5i0 

Leu Ser Glu Ala Tyr Gly He Lys Gly He Arg He Ser Ser Glu Ala 
515 520 ~ 525 

Glu Ala Lys Glu Lys Leu Glu Glu Ala Leu Thr Ser Arg Glu Pro Val 
530 535 540 

Val He Asp Val Arg Val Ala Ser Glu Glu Lys Val Phe Pro Met Val 
545 550 555 560 

Ala Pro Gly Lys Gly Leu His Glu Met Val Gly Val Lys Pro 
565 570 



<210> 33 
<211> 525 
<212> DNA 

<213> Bacillus subtilis 

<220> 

<221> CDS 

<222> (1) . . (522) 

<400> 33 

ttg aaa aga att ate aca ttg act gtg gtg aac cgc tec ggg gtg tta 

Met Lys Arg He He Thr Leu Thr Val Val Asn Arg Ser Gly Val Leu 
15 10 15 

aac egg ate ace ggt eta ttc aca aaa agg cat tac aac att gaa age 
Asn Arg He Thr Gly Leu Phe Thr Lys Arg His Tyr Asn He Glu Ser 
20 25 30 

att aca gtt gga cac aca gaa aca gec ggc gtt tec aga ate ace ttc 
He Thr Val Gly His Thr Glu Thr Ala Gly Val Ser Arg He Thr Phe 
35 40 45 

gtc gtt cat gtt gaa ggt gaa aat gat gtt gaa cag tta acg aaa cag 
Val Val His Val Glu Gly Glu Asn Asp Val Glu Gin Leu Thr Lys Gin 
50 55 60 

etc aac aaa cag att gat gtg ctg aaa gtc aca gac ate aca aat caa 
Leu Asn Lys Gin He Asp Val Leu Lys Val Thr Asp He Thr Asn Gin 
65 70 75 80 



teg att gtc cag agg gag ctg gee tta ate aag gtt gtc tec gca cct 
Ser He Val Gin Arg Glu Leu Ala Leu He Lys Val Val Ser Ala Pro 
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85 90 95 

tea aca aga aca gag att aat gga ate ata gaa ccg ttt aga gec tct 
Ser Thr Arg Thr Glu He Asn Gly He He Glu Pro Phe Arg Ala ~Ser 
100 105 HO 

gtc gtt gat gtc age aga gac age ate gtt gtt -cag ^ftg aca ggt gaa 
Val Val Asp Val Ser Arg Asp Ser He Val Val Gin Val Thr Gly Glu 
115 120 125 

tct aac aaa att gaa gcg -ctt att gag tta tta aaa cct tat ggc att 
Ser Asn Lys He Glu Ala Leu He Glu Leu Leu Lys Pro Tyr "Gly He 
130 135 140 

aaa gaa ate gcg aga aca ggt aca aeg get ttt gcg agg gga ace age 
Lys Glu He Ala Arg Thr Gly Thr Thr Ala Phe Ala Arg <31y Thr Ser 
145 150 155 160 

aaa agg cgt cat cca ata aaa caa tat eta ttg tat aaa aca taa 
Lys Arg Arg His Pro He Lys Gin Tyr Leu Leu Tyr Lys Thr 
165 170 



<210> 34 
.<211> 174 

<212> PRT k ' ' * 

<213> Bacillus subtilis 

<400> 34 

Met Lys Arg He He Thr Leu Thr Val Val Asn Arg Ser Gly Val Leu 
15 10 15 

Asn Arg He Thr Gly Leu Phe Thr Lys Arg His Tyr Asn He Glu Ser 
20 25 30 

He Thr Val Gly His Thr Glu Thr Ala Gly Val Ser Arg He Thr Phe 
35 40 45 

Val Val His Val Glu Gly Glu Asn Asp Val Glu -Gin Leu Thr Lys Gin 
50 55 60 

Leu Asn Lys Gin He Asp Val Leu Lys Val Thr Asp He Thr Asn Gin 
65 70 75 80 

Ser He Val Gin Arg Glu Leu Ala Leu He Lys Val Val Ser Ala Pro 
85 90 95 

Ser Thr Arg Thr Glu He Asn -Gly He lie Glu Pro Phe Arg Ala Ser 
100 105 HO 

Val Val Asp Val Ser Arg Asp Ser He Val Val Gin Val Thr Gly <Hu 
115 "* 120 125 

Ser Asn Lys He Glu Ala Leu He Glu Leu Leu Lys Pro Tyr *Gly He 
130 135 140 

Lys Glu He Ala Arg Thr Gly Thr Thr Ala Phe Ala Arg <31y Thr Ser 
145 150 155 160 



336 



384 



432 



4B0 



525 
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Lys Arg Arg His Pro He Lys Gin Tyr Leu Leu Tyr Lys Thr 
165. ■ 170 



<210> 35 
<211> 1029 
<212> DNA 

<213> Bacillus subtilis 

<220> 

<221> CDS 

<222> (1) . . (1026) 

<400> 35 

atg gta aaa gta tat tat aac ggt gat ate aaa gag aac gta ttg get 48 
Met Val Lys Val Tyr Tyr Asn Gly Asp He Lys Glu Asn Val Leu Ala 
15 10 15 

gga aaa aca gta gcg gtt ate ggg tac ggt teg caa ggc cac gca cat 96 
Gly Lys Thr Val Ala Val He Gly Tyr Gly Ser Gin Gly His Ala His 
20 25 30 

gec ctg aac ctt aaa gaa age gga gta gac gtg ate gtc ggt gtt aga 14 4 
Ala Leu Asn Leu Lys Glu Ser Gly Val Asp Val He Val Gly Val Arg 
35 40 45 

caa gga aaa tct ttc act: caa gec caa gaa gac gga cat aaa gta ttt 192 
Gin Gly Lys Ser Phe Thr Gin Ala Gin Glu Asp Gly His Lys Val Phe 
50 55 60 

tea gta aaa gaa gcg gca gec caa gec gaa ate ate atg gtt ctg ctt 240 
Ser Val Lys Glu Ala Ala Ala Gin Ala Glu He He Met Val Leu Leu 
65 70 75 80 

ccg gat gag cag cag caa aaa gta tac gaa get gaa ate aaa gat gaa 288 
Pro Asp Glu Gin Gin Gin Lys Val Tyr Glu Ala Glu He Lys Asp Glu 
85 90 95 



ttg aca gca gga aaa tea tta gta ttc get cat gga ttt aac gtg cat 
Leu Thr Ala Gly Lys Ser Leu Val Phe Ala His Gly Phe Asn Val His 
100 105 " HO 



336 



ttc cat caa att gtt cct ccg gcg gat gta gat gta ttc tta gtg gec 384 
Phe His <Sln He Val Pro Pro Ala Asp Val Asp Val Phe Leu Val Ala 
115 120 125 

cct aaa ggc ccg gga cac ttg gta aga aga aca tat gag caa gga get 4 32 

Pro Lys Gly Pro Gly His Leu Val Arg Arg Thr. Tyr Glu Gin Gly Ala 
130 135 140 

ggc gta cct gca ttg ttc gca ate tat caa gat gtg act gga gaa gca 4 80 

Gly Val Pro Ala Leu Phe Ala He Tyr Gin Asp Val Thr Gly Glu Ala 
145 150 155 * 160 

aga gac aaa gee etc get tat get aaa gga ate ggc ggc gca aga gcg 528 

Arg Asp Lys Ala Leu Ala Tyr Ala Lys Gly He Gly Gly Ala Arg Ala 

165 170 * 175 



WO 01/021772 



PCT/US0O/25993 



42 



576 



624 



672 



720 



age gta tta gaa acg aca ttt aaa gaa gaa aca gaa aca gat ttg ttc 
Gly Val Leu Glu Thr Thr Phe Lys -Glu Glu Thr <51u Thr Asp Leu Phe 
180 185 190 

ggt gag caa gca gtt ctt tgc ggc gga tta age gcg ctt gtc aaa gec 
Gly Glu Gin Ala Val Leu Cys <31y «31y Leu Ser Ala Leu Val iys Ala 
195 200 205 

gga ttt gaa acc tta act gaa gca ggt tat cag cct -gaa -ctt gca tac 
Gly Phe Glu Thr Leu Thr Glu Ala Gly Tyr Gin Pro Glu Leu Ala Tyr 
210 215 220 

ttc gag tgt ctt cat gag ctg aaa tta ate gta gac ctt atg tac gaa 
Phe Glu Cys Leu His Glu Leu Lys Leu He Val Asp Leu Met Tyr Glu 
225 230 235 240 

gaa gga ctt gca gga atg aga tat tea ate *ct gac aca gca cag tgg 768 
Glu Gly Leu Ala Gly Met Arg Tyr Ser He Ser Asp Thr Ala Gin Trp 
245 250 1 255 

gga gat ttc gta tea ggc cct cgc gtt gtg gac gee aaa gta aaa gaa 
Glv Asp Phe Val Ser Gly Pro Arg Val Val Asp Ala Lys Val Lys <31u 
260 265 270 

tct atg aaa gaa gta tta aaa gat: 'ate caa aac ggt aca tte gca aaa' 
Ser Met Lys Glu Val Leu Lys Asp He Gin Asn Gly Thr Phe Ala Lys 



816 



864 



275 



280 285 



gag tgg ate gtc gaa aac .caa gta aac cgt cct egt ttc aac get ate 
Glu Trp He Val Glu Asn Gin Val Asn Arg Pro Arg Phe Asn Ala He 
290 295 300 

aat gca age gag aac gaa cat caa ate gaa gta gtg gga aga aag ctt 
Asn Ala Ser Glu Asn Glu His Gin He Glu Val Val Gly Arg Lys Leu 
305 310 315 320 

cgt gaa atg atg ccg ttt gtg aaa caa ggc aag aag aag gaa gcg gtg 
Ara Glu Met Met Pro Phe Val Lys Gin Gly Lys Lys Lys <31u Ala Val 
325 330 335 

gtc tec gtt gcg caa aat taa 
Val Ser Val Ala Gin Asn 
340 



<210> 36 
<211> 342 
<212> PRT 

<213> Bacillus subtilis 
<400> 36 

Met Val Lys Val Tyr Tyr Asn Gly Asp He Lys <51u Asn Val Leu Ala 
15 10 15 

Glv Lys Thr Val Ala Val He Gly Tyr Gly Ser Gin "Gly His Ala His 
20 25 30 



912 



960 



1008 



1029 
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Ala 



Leu Asn Leu Lys Glu Ser 



Gly Val 
40 



Asp Val 



He Val Gly 
45 



Val 



Arg • 



35 



Gin Gly Lys Ser Phe Thr Gin Ala Gin Glu Asp Gly His Lys Val Phe 
50 , 55 60 

Ser Val Lys Glu Ala Ala Ala Gin Ala Glu lie He Met Val Leu Leu 
65 70 75 80 

Pro Asp Glu Gin Gin Gin Lys Val Tyr Glu Ala Glu He Lys Asp Glu 
85 90 " 95 

Leu Thr Ala Gly Lys Ser Leu Val Phe Ala His Gly Phe Asn Val His 
100 105 no 

Phe His Gin He Val Pro Pro Ala Asp Val Asp Val Phe Leu Val Ala 
115 120 125 

Pro Lys Gly Pro Gly His Leu Val Arg Arg Thr Tyr G^u Gin Gly Ala 
130 135 140 

Gly Val Pro Ala Leu Phe Ala He Tyr Gin Asp Val Thr Gly Glu Ala 
145 150 155 160 

Arg Asp Lys Ala Leu Ala Tyr Ala Lys Gly He Gly Gly Ala Arg Ala 
165 170 175 

Gly Val Leu Glu Thr Thr Phe Lys Glu Glu Thr Glu Thr Asp Leu Phe 
180 185 190 

Gly Glu Gin Ala Val Leu Cys Gly Gly Leu Ser Ala Leu Val Lys Ala 
195 200 205 

Gly Phe Glu Thr Leu Thr Glu Ala Gly Tyr Gin Pro Glu Leu Ala Tyr 
210 215 220 

Phe Glu Cys Leu His Glu Leu Lys Leu He Val Asp Leu Met Tyr Glu 
225 230 235 240 

Glu Gly Leu Ala Gly Met Arg Tyr Ser He Ser Asp Thr Ala Gin Trp 
245 250 255 

Gly Asp Phe Val Ser Gly Pro Arg Val Val Asp Ala Lys Val Lys Glu 
260 265 270 

Ser Met Lys Glu Val Leu Lys Asp He Gin Asn Gly Thr Phe Ala Lys 
275 280 285 

Glu Trp He Val Glu Asn Gin Val Asn Arg Pro Arg Phe Asn Ala He 
290 295 " 300 

Asn Ala Ser Glu Asn Glu His Gin He Glu Val Val Gly Arg Lys Leu 
305 310 315 320 

Arg Glu Met Met Pro Phe Val Lys Gin Gly Lys Lys Lys Glu Ala Val 
325 330 335 

Val Ser Val Ala Gin Asn 
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340 



<210> 37 
<211> 1674 
<212> ONA 

<213> Bacillus subtilis , 

<220> 
<221> CDS 
<222> (1)..(1674) 

<400> 37 

atg gca gaa tta cgc agt aat atg ate aca caa -gga ate ^gat aga get 
Met Ala Glu Leu Arg Ser Asn Met lie Thr -Gin -Gly lie Asp Arg Ala 
1.5 10 15 

ccg cac cgc agt ttg ctt cgt gca gca ggg gta aaa gaa gag gat ttc 96 
Pro His Arg Ser Leu Leu Arg Ala Ala <51y Val Lys Glu Glu Asp *he 
20 25 .30 



48 



ggc aag ccg ttt att gcg gtg tgt aat tea tac att -^at ate gtt cec 
Gly Lys Pro Phe He Ala Val Cys Asn Ser Tyr He Asp He Val Pro 
35 40 45 

ggt cat gtt cac ttg eag gag ttt -ggg aaa ate gta aaa gaa gca ate 
Gly His Val His Leu Gin -Glu Phe Gly Lys He Val Lys Glu Ala He 
50 55 60 

aga gaa gca ggg ggc gtt ccg ttt gaa ttt aat ace att ggg gta gat 
Arg Glu Ala Gly Gly Val Pro Phe Glu Phe Asn Thr He tSly Val Asp 
65 70 75 80 

gat ggc ar.c gca atg ggg cat ate ggt atg aga tat ^:cg etg cca age 
Asp Gly He Ala Met Gly His He Gly Met Arg Tyr Ser Leu Pro Ser 
85 90 95 

cgt gaa att ate gca gae tct gtg gaa acg gtt gta tec gca cac tgg 
Arg Glu He He Ala Asp -Ser Val <31u Thr Val Val Ser Ala iiis Trp 
100 105 HO 

ttt gae gga atg gte tgt att ccg aae tgc gae aaa ate aca -ccg gga 
Phe Asp <51y Met Val Cys He Pro Asn Cys Asp Lys lie Thr Pro *Gly 
115 120 125 



ggc gga ccg atg gcg gca gga aga aca agt tac ggg cga aaa ate tec 
Gly Gly Pro Met Ala Ala Gly Arg Thr Ser Tyr <31y Arg Lys He Ser 
145 150 155 160 

ctt tec tea gta ttc gaa ggg gta ggc gec tac caa ^gca ggg aaa ate 
Leu Ser Ser Val Phe Glu Gly Val Gly Ala Tyr <Sln Ala <31y Lys He 
165 170 175 



144 



192 



240 



288 



336 



384 



atg ctt atg gcg gca atg cgc ate aae att ccg acg att ttt gtc age 432 
Met Leu Met Ala Ala Met Arg lie Asn He Pro Thr He Phe Val Ser 
130 135 140 



480 



S28 



aae gaa aae gag ctt caa gaa eta gag cag ttc ^ga tgc cca acg tgc 576 
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Asn Glu Asn Glu Leu Gin Glu Leu Glu Gin Phe Gly Cys Pro Thr Cys ' 

180 185 190 

i 

ggg tct tgc tea ggc atg ttt acg gcg aac tea atg aac tgt* ctg tea 624 
Gly Ser Cys Ser Gly Met Phe Thr Ala Asn Ser Met Asn Cys Leu Ser 
195 200 205 

gaa gca ctt ggt ctt get ttg ecg ggt aat gga ace att ctg gca aca 672 
Glu Ala Leu Gly Leu Ala Leu Pro Gly Asn Gly Thr He' Leu Ala Thr 
210 215 220 

tct ccg gaa cgc aaa gag ttt gtg aga aaa teg get gcg caa tta atg 720 
Ser Pro Glu Arg Lys Glu Phe Val Arg Lys Ser Ala Ala Gin Leu Met 
225 230 235 240 

gaa acg att cgc aaa gat ate aaa ccg cgt gat att gtt aca gta aaa 768 
Glu Thr lie Arg Lys Asp He Lys Pro Arg Asp He Val Thr Val Lys 
245 250 255 

gcg att gat aac gcg ttt gca etc gat atg gcg etc gga ggt tct aca 816 
Ala He Asp Asn Ala Phe Ala Leu Asp Met Ala Leu Gly Gly Ser Thr 
260 265 270 

aat ace gtt ctt cat aec ctt gee ctt gca aac gaa gee ggc gtt gaa 864 
Asn Thr Val Leu His Thr Leu Ala Leu Ala Asn Glu Ala Gly Val Glu 
275 280 285 

tac tct tta gaa cgc att' aac gaa gtc get gag cgc gtg ccg cac ttg 91? ' 
Tyr Ser Leu Glu Arg He Asn Glu Val Ala Glu Arg Val Pro His Leu 
290 295 300 

get aag ctg gcg cct gca teg gat gtg ttt att gaa gat ctt cac gaa 960 
Ala Lys Leu Ala Pro Ala Ser Asp Val Phe lie Glu Asp Leu His Glu 
305 310 315 " 320 

gcg ggc ggc gtt tea gcg get ctg aat gag ctt teg aag aaa gaa gga 1008 
Ala Gly Gly Val Ser Ala Ala Leu Asn Glu Leu Ser Lys Lys Glu Gly 
325 330 " 335 

gcg ctt cat tta gat gcg ctg act gtt aca gga aaa act ctt gga gaa 1056 
Ala Leu His Leu Asp Ala Leu Thr Val Thr Gly Lys Thr Leu Gly Glu 
340 345 350 

aec att gee gga cat gaa gta aag gat tat gac gtc att cac ccg ctg 1104 
Thr He Ala Gly His Glu Val Lys Asp Tyr Asp Val He His Pro Leu 
355 360 365 

gat caa cca ttc act gaa aag gga ggc ctt get gtt tta ttc ggt aat 1152 
Asp Gin Pro Phe Thr Glu Lys Gly Gly Leu Ala Val Leu Phe Gly Asn 
370 375 380 

eta get ccg gac ggc get ate att aaa aca ggc ggc gta cag aat ggg 1200 
Leu Ala Pro Asp Gly Ala He He Lys Thr Gly Gly Val Gin Asn Gly 
385 390 395 400 

att aca aga cac gaa ggg ccg get gtc gta ttc gat tct cag gac gag 1248 
He Thr Arg His Glu Gly Pro Ala Val Val Phe Asp Ser Gin Asp Glu 
405 410 415 
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gcg ctt gac ggc att ate aac -cga aaa -gta aaa gaa ggc gac gtt gtc 1296 
Ala Leu Asp Gly He He Asn Arg Lys Val Lys <31u Gly Asp Val Val 
420 425 430 

i 

ate ate aga tac gaa ggg cca aaa ggc gga cct ^gc atg ccg gaa atg 134 4 
He He Arg Tyr Glu Gly Pro Lys <51y t31y Pro =Gly Met Pro,«Glu Met 
435 440 445 



ctg gcg cca aca tec caa ate gtt gga atg gga etc ggg cca aaa -gtg 
Leu Ala Pro Thr Ser Gin lie Val Gly Met €ly Leu X51y Pro Lys Val 
450 455 460 • 

gca ttg att acg gac gga cgt ttt tec gga gee tec cgt ggc etc tea 
Ala Leu He Thr Asp Gly Arg Phe Ser Gly Ala Ser Arg Gly Leu Ser 
465 470 475 480 

ate ggc cac gta tea cct gag gee get gag ggc <ggg <:cg ett gee ttt 
He Gly His Val Ser Pro Glu Ala Ala Glu Gly Gly Pro Leu Ala Phe 
485 490 • 495 



gat gta caa gtg cca gaa gaa gag* tgg gaa aaa cga* -aaa gcg aae tgg 
Asp Val Gin- Val Pro Glu Glu -Glu Trp Glu Lys Arg Lys Ala Asn Trp 
515 520 525 



1392 



1440 



1483 



gtt gaa aac gga gac cat att ate gtt gat att gaa aaa cgc ate ttg 1536 
Val Glu Asn Gly Asp His He He Val Asp He Glu Lys Arg He Leu 
500 505 510 



1584 



aaa ggt ttt gaa ccg aaa gtg aaa acc ggc tac ctg gca cgt tat tct 1632 
Lys Gly Phe Glu Pro Lys Val Lys Thr Gly Tyr Leu Ala Arg Tyr Ser 
530 535 540 

aaa ctt gtg aca agt gee aac acc ggc ggt att atg aaa ate 1*674 
Lys Leu Val Thr Ser Ala Asn Thr Gly -Gly He Met Lys He 
545 550 555 



<210> 38 
<211> 558 
<212> PRT 

<213> Bacillus subtilis 
<400> 38 

Met Ala Glu Leu Arg Ser Asn Met He Thr <31n -Gly He Asp Arg Ala 
15 10 15 

Pro His Arg Ser Leu Leu Arg Ala Ala Gly Val Lys <31u Glu Asp Phe 
20 25 30 

Gly Lys Pro Phe He Ala Val Cys Asn Ser Tyr He Asp He Val Pro 
35 40 45 

Gly His Val His Leu Gin Glu Phe Gly Lys lie Val Lys <51u Ala He 
50 55 60 

Arg Glu Ala Gly Gly Val Pro Phe -Glu Phe Asn Thr He Gly Val Asp 
65 70 75 60 
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Asp Gly He Ala Met Gly His He Gly Met Arg Tyr Ser Leu Pro Ser 
85 90 " 95 

Arg Glu He He Ala Asp Ser Val Glu Thr Val Val Ser Ala His Trp 
100 105 HO 

Phe Asp Gly Met Val Cys He Pro Asn Cys Asp Lys He Thr Pro Gly 
115 120 ~ 125* 

Met Leu Met Ala Ala Met Arg He Asn He Pro Thr He Phe Val Ser 
130 135 140 

Gly Gly Pro Met Ala Ala Gly Arg Thr Ser Tyr Gly Arg Lys He Ser 
I 45 150 155 160 

Leu Ser Ser Val Phe Glu Gly Val Gly Ala Tyr Gin Ala Gly Lys He 
165 170 175 

Asn Glu Asn Glu Leu Gin Glu Leu Glu Gin Phe Gly Cys Pro Thr Cys 
180 185 190 

Gly Ser Cys Ser Gly Met Phe Thr Ala Asn Ser Met Asn Cys Leu Ser 
195 200 205 

Glu Ala Leu Gly Leu Ala Leu Pro Gly Asn Gly Thr He Leu Ala Thr 
210 215 220 

Ser Pro Glu Arg Lys Glu Phe Val Arg Lys Ser Ala Ala Gin Leu Met 
225 230 235 240 

Glu Thr He Arg Lys Asp He Lys Pro Arg Asp He Val Thr Val Lys 
245 250 255 

Ala He Asp Asn Ala Phe Ala Leu Asp Met Ala Leu Gly Gly Ser Thr 
260 265 270 

Asn Thr Val Leu His Thr Leu Ala Leu Ala Asn Glu Ala Gly Val Glu 
275 280 285 

Tyr Ser Leu Glu Arg He Asn Glu Val Ala Glu Arg Val Pro His Leu 
290 295 300 



Ala Lys Leu Ala Pro Ala Ser Asp Val Phe He Glu Asp Leu His Glu 

305 310 33 5 320 

Ala Gly Gly Val Ser Ala Ala Leu Asn Glu Leu Ser Lys Lys Glu Gly 

325 330 ~ 335 

Ala Leu His Leu Asp Ala Leu Thr Val Thr Gly Lys Thr Leu Gly Glu 

340 345 350 

Thr He Ala Gly His Glu Val Lys Asp Tyr Asp Val He His Pro Leu 

355 360 365 

Asp Gin Pro Phe Thr Glu Lys Gly Gly Leu Ala Val Leu Phe Gly Asn 

370 375 380 
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Leu Ala Pro Asp Gly Ala -lie lie Lys Thr Gly JGly Val Gin Asn Gly 
385 390 395 400 

lie Thr Arg His Glu «Gly Pro Ala Val Val Phe Asp Ser Gin Asp Glu 
405 ~ 410 415 

Ala Leu Asp Gly He He Asn Arg Lys Val Lys Glu Gly Asp Val Val 
420 425 430 

He He Arg Tyr -Glu Gly Pro Lys -Gly Gly Pro Gly Met Pr6 -Glu Met 
435 440 445 

Leu Ala Pro Thr Ser Gin He Val Gly Met Gly Leu Gly Pro Lys Val 
450 455 4£0 

Ala Leu He Thr Asp Gly Arg Phe Ser <Gly Ala ^Ser Arg Gly Leu Ser 
465 470 475 ( 480 

He Gly His Val Ser Pro Glu Ala Ala Glu Gly Gly Pro Leu Ala Phe 
485 490 495 

Val Glu Asn Gly Asp His lie lie Val Asp He -Glu Lys Arg lie Leu 
500 505 510 

Asp Val Gin Val Pro Glu Glu Glu Trp Glu Lys Arg Lys Ala Asn Trp 

515 1 • 520 525 

Lys Gly Phe Glu Pro Lys Val Lys Thr Gly Tyr Leu Ala Arg Tyr Ser 
530 535 540 

Lys Leu Val Thr Ser Ala Asn Thr -Gly -Gly He Met Lys He 
545 550 555 



<210> 39 
<211> 194 
<212> DNA 

<213> Artificial Sequence 
' <220> 

<223> Description of Artificial Sequence : promoter 
sequence 

<220> 

<221> -35_signal 
<222> (136). .{141) 

<220> 

<221> -10_signal 
<222> (159).. (164) 

<400> 39 

gctattgacg acagctatgg ttcactgtcc accaaccaaa actgtgctca gtaccgccaa 60 
tatttctccc ttgaggggta caaagaggtg tccctagaag agatccacgc tgtgtaaaaa 120 
ttttacaaaa aggtattgac tttccctaca -gggtgtgtaa taatttaatt acaggcgggg 180 
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gcaaccccgc ctgt 194 



<210> 40 
<211> 163 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : promoter 
sequence 

<220> 

<221> -35_signal 
<222> (113) . . (118) 

<220> 

<221> -10_signal 
<222> (136) (141) 

<400> 40 

gcctacctag cttccaagaa agatatccta acagcacaag agcggaaaga tgttttgttc 60 
tacatccaga acaacctctg ctaaaattcc tgaaaaattt tgcaaaaagt tgttgacttt 120 
atctacaagg tgtggtataa taatcttaac aacagcagga cgc 163 



<210> 41 
<211> 127 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: promoter 
sequence 

<220> 

<221> -35_signal 
<222> (34) . . (39) 

<220> 

<221> -10_signal 
<222> (58).. (63) 

<220> 

<221> -35_signal 
<222> (75) . . (80) 

<220> 

<221> -10_signal 
<222> (98).. (103) 

<400> 41 

gaggaatcat agaattttgt caaaataatt ttattgacaa cgtcttatta acgttgatat 60 
aatttaaatt ttatttgaca aaaatgggct cgtgttgtac aataaatgta gtgaggtgga 120 
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tgcaatg 



<210> 42 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : ribo some 
binding site 

<400> 42 

taaacatgag gaggagaaaa catg 

<210> 43 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : ribo some 
binding site 



<400> 43 

attcgagaaa tggagagaat ataatatg 

<210> 44 
<211> 13 
<212> DNA 

<213> Artificial Sequence 



<2B 



<220> 

<223> Description of Artificial Sequence : ribo some 
binding site 

<400> 44 
agaaaggagg tga 

<210> 45 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : ribosome 
binding site 

<400> 45 

ttaagaaagg aggtgannnn atg 

<210> 46 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: ribosome 
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binding site 
<400> 46 

ttagaaagga ggtgannnnn atg 



<210> 47 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence rribosome 
binding site 

<400> 47 

agaaaggagg tgannnnnnn atg 

<210> 48 
<211> 22 

<212> DNA i 
• <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence rribosome 
binding site 

<400> 48 

agaaaggagg tgannnnnna tg 

<2ia> 49 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence rribosome 
binding site 

<400> 49 

ccctctagaa ggaggagaaa acatg 

<210> 50 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence rribosome 
binding site 

<400> 50 

ccctctagag .gaggagaaaa catg 

<210> 51 
<211> 23 
<212> DNA 
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23 



23 



22 



25 



WO 01/021772 



PCT/US00/25993 



-52- 



<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence :ribosome 
binding site 

<400> 51 

ttagaaagga ggatttaaat atg 

<210> 52 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : ribosome 
binding site 

<400> 52 

ttagaaagga ggtttaatta atg 

<210> 53 
<211> 23 
<212> DNA 

<213> Artificial Sequence 

< . 

<220> 

<223>' Description of Artificial Sequence: ribosome 
binding site 

<400> 53 

ttagaaagga ggtgatttaa atg 

<210> 54 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: ribosome 
binding site 

<400> 54 

ttagaaagga ggtgtttaaa atg 

<210> 55 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : ribosome 
binding site 

<400> 55 

attcgagaaa ggaggtgaat ataatatg 



<210> 56 
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<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence iribosome 
binding site 

<400> 56 

attcgagaaa ggaggtgaat aataatg 27 

<210> 57 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : ribo some 
binding site 

<400> 57 

attcgtagaa aggaggtgaa ttaatatg i 28 



<210> 58 
<211> 3291 
<212> DNA 

<213> Bacillus subtilis 



<400> 58 
atggggacta 


atgtacaggt 


ggattcagca tctgccgaat 


gtacacagac 


gatgagcgga 


60 


gcattaatgc 


tgattgaatc 


attaaaaaaa gagaaagtag 


aaatgatctt 


cggttatccg 


120 


ggcggggctg 


tgcttccgat 


ttacgataag ctatacaatt 


cagggttggt 


acatatcctt 


180 


ccccgtcacg 


aacaaggagc 


aattcatgca gcggagggat 


acgcaagggt 


ctccggaaaa 


240 


ccgggtgtcg 


tcattgccac 


gtcagggccg ggagcgacaa 


accttgttac 


aggccttgct 


300 


gatgccatga 


ttgattcatt 


gccgttagtc gtctttacag 


ggcaggtagc 


aacctctgta 


360 


atcgggagcg 


atgcatttca 


ggaagcagac attttaggga 


ttacgatgcc 


agtaacaaaa 


420 


cacagctacc 


aggttcgcca 


gccggaagat ctgccgcgca 


tcattaaaga 


agcgttccat 


480 


attgcaacaa 


ctggaagacc 


cggacctgta ttgattgata 


ttccgaaaga 


tgtagcaaca 


540 


attgaaggag 


aattcagcta 


cgatcatgag atgaatctcc 


cgggatacca 


gccgacaaca 


600 


gagccgaatt 


atttgcagat 


ccgcaagctt gtggaagccg 


tgagcagtgc 


gaaaaaaccg 


660 


gtgatcctgg 


cgggtgcggg 


cgtactgcac ggaaaagcgt 


cagaagaatt 


aaaaaattat 


720 


gctgaacagc 


agcaaatccc 


tgtggcacac acccttttgg 


ggctcggagg 


cttcccggct 


780 


gaccatccgc 


ttttcctagg 


gatggcggga atgcacggta 


cttatacagc 


caatatggcc 


840 



WO 01/021772 PCT/USOO/25993 

-34- 

cttcatgaat gtgatctatt aatcagtatc ggcgcccgtt tt^atgaccg tgtcacagga 900 
aacctgaaac actttgccag aaacgcaaag atagcccaca tcgatattga tccagctgaa 950, 
atcggaaaaa tcatgaaaac acagattcct gtagtcggag acagcaaaat tgtcctgcag 1020 
gagctgatca aacaagacgg caaacaaagc gattcaagcg aatggaaaaa acagctcgca 1080 
gaatggaaag aagagtatcc gctctggtat gtagataatg aagaagaagg ttttaaacct 1140 
cagaaattga ttgaatatat tcat<:aattt acaaaaggag aggccattgt cgcaacggat 1200 
gtaggccagc atcaaatgtg gtcagcgcaa ttttatccgt tccaaaaagc agataaatgg 1260 
gtcacgtcag gcggacttgg aacgatggga tt-cggtcttc cggcggcgat cggcgcacag 1320 
ctggccgaaa aagatgctac tgttgtcgcg gttgtcggag acggcggatt ccaaatgacg 1380 
cttcaagaac tcgatgttat tcgcgaatta aatcttccgg 'tcaaggtagt -gattttaaat 1440 
aacgcttgtc tcggaatggt cagacagtgg caggaaattt tctatgaaga acgttattca 1500 
gaatctaaat tcgcttctca gcctgacttc -gtcaaattgt ccgaagcata cggcattaaa 1560 
ggcatcagaa tttcatcaga agciggaagca aaggaaaagc tggaagaggc attaacabca 1620 
agagaacctg ttgtcattga cgtgcgggtt gccagcgaag aaaaagtatt cccgatggtg 1680 
gctccgggga aagggctgca tgaaatggtg ggggtgaaac cttgaaaaga attatcacat 17 4-0 
tgactgtggt gaaccgctcc ggggtgttaa accggat<cac -cggtctattc acaaaaaggc 1800 
attacaacat tgaaagcatt acagttggac acacagaaac agccggcgtt tccagaatca 1360 
ccttcgtcgt tcatgttgaa ggtgaaaatg atgttgaaca gttaacgaaa cagctcaaca 1920 
aacagattga tgtgctgaaa gtcacagaca tcacaaatca atcgattgtc cagagggagc 1980 
tggccttaat caaggttgtc tccgcacctt caacaagaac agagattaat ggaatcatag 2040 
aaccgtttag agcctctgtc gttgatgtca gcagagacag catcgttgtt ^caggtgacag 2100 
gtgaatctaa caaaattgaa gcgcttattg agttattaaa aecttatggc attaaagaaa 2160 
tcgcgagaac aggtacaacg gcttttgcga ggggaaccag caaaaggcgt catccaataa 2220 
aacaatatct attgtataaa acataacaag ggagagattg aaatggtaaa agtatattat 2280 
aacggtgata tcaaagagaa cgtattggct ggaaaaacag tagcggttat cgggtacggt 2340 
tcgcaaggcc acgcacatgc cctgaacctt aaagaaagcg gagtagacgt gat-cgtc^gt 24t)0 
gttagacaag gaaaatcttt cactcaagcc caagaagacg gacataaagt attttcagta 24-60 
aaagaagcgg cagcccaagc cgaaatcatc atggttctgc ttecggatga gcagcagcaa 2520 
aaagtatacg aagctgaaat caaagatgaa ttgacagcag gaaaatoatt agtattcgct 2580 
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catggattta acgtgcattt ccatcaaatt gttcctccgg cggatgtaga tgtattctta 2640 
gtggccccta aaggcccggg acacttggta agaagaacat atgagcaagg agctggcgta 2700 
cctgcattgt tcgcaatcta tcaagatgtg actggagaag caagagacaa agccctcgct 2760 
tatgctaaag gaatcggcgg cgcaagagcg ggcgtattag aaacgacatt taaagaagaa 2820 
acagaaacag atttgttcgg tgagcaagca gttctttgcg gcggattaag egcgcttgtc 2880 
aaagccggat ttgaaacctt aactgaagca ggttatcagc ctgaacttgc atacttcgag 2940 
tgtcttcatg agctgaaatt aatcgtagac cttatgtacg aagaaggact tgcaggaatg 3000 
. agatattcaa tctctgacac agcacagtgg ggagatttcg tatcaggccc tcgcgttgtg 3060 
gacgccaaag taaaagaatc tatgaaagaa gtattaaaag atatccaaaa cggtacattc 3120 
gcaaaagagt ggatcgtcga aaaccaagta aaccgtcctc gtttcaacgc tatcaatgca 3180 
agcgagaacg aacatcaaat cgaagtagtg ggaagaaagc ttcgtgaaat gatgccgttt 3240 
gtgaaacaag gcaagaagaa ggaagcggtg gtctccgttg cgcaaaatta a 3291 



<210> 59 
<211> 2363 
<212> DNA 

<213> Bacillus subtilis 

<220> 
<221> CDS 

<222> (242) . . (1072) 

<220> • 
<221> CDS 

<222> (1077) . . (1934) 

<220> 
<221> CDS 

<222> (1939) . . (2319) 
<400> 59 

ttggtacaag cccgttgatt ttggtatact tccattgggc agtatcgect gcgaactgca 60 

cctattatta aaatagatag acattgcagc agtctgcctt gatccaaaaa aggactggga 120 

cagagggatg aaactcgccg aactttagaa agtgaagaat ccttctcgtt gtaacggaag 180 

gttttttggc ttgcagaaga aaacggcaga tcatctcctc taaacatgag gaggagaaaa 240 

c atg aaa aca aaa ctg gat ttt eta aaa atg aag gag tct gaa gaa ccg 289 
Met Lys Thr Lys Leu Asp Phe Leu Lys Met Lys Glu Ser Glu Glu Pro 
1 5 10 15 

att gtc atg ctg acc get tat gat tat ccg gca get aaa ctt get gaa 337 
lie Val Met Leu Thr Ala Tyr Asp Tyr Pro Ala Ala Lys Leu Ala Glu 
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20 • 25 30 

caa gcg gga gtt -gac atg att tta gtc ggt gat tea ctt gga atg gtc 385 

•Gin Ala Gly Val Asp Met 11b Leu Val Gly Asp Ser Leu Gly Met Val 

35 40 45 

gtc etc ggc ctt gat tea act gtc ggt gtg aca gtt gcg gac atg ate 4 33 

Val Leu Gly Leu Asp Ser Thr Val Gly Val Thr Val Ala Asp Met He 

50 55 SO 



cat cat aca aaa gec gtt aaa agg ggt -gcg ccg aat acc ttt att gtg 
His His Thr Lys Ala Val Lys Arg Gly Ala Pro Asn Thr Phe He Val 
65 70 75 80 



ggc gta ctg ggc ggc tat aaa gta cag ggc aaa gac gaa caa age gee 
Gly Val Leu Gly Gly Tyr Lys Val Gin Gly Lys Asp <31u Gin Ser Ala 
145 ^ 150 155 160 



gaa aca gca ate age gga tat gtt cag gat gta aga cat cgt get ttc 
Glu Thr Ala He Ser Gly Tyr Val Gin Asp Val Arg His Arg Ala Phe 
245 250 255 



481 



aca gat atg ccg ttt atg tct tat eac ctg tct aag gaa gat acg ctg 529 
Thr Asp Met Pro Phe Met Ser Tyr His Leu Ser Lys Glu Asp Thr Leu 
85 90 ( 95 

aaa aat gca gcg get ate gtt -cag gaa age gga get gac gca ctg aag 577 
Lys Asn Ala Ala Ala He Val Gin Glu Ser -Gly Ala Asp Ala Leu Lys 
100 105 110 

ctt gag ggc gga gaa ggc gtg ttt gaa tec att cgc gca ttg acg ctt 625 
Leu Glu Gly Gly Glu Gly Val Phe Glu Ser lie Arg Ala Leu Thr Leu 
115 ~ 120 125 

gga ggc att cca gta gtc agt eac tta ggt ttg aca ccg -cag tea gtc 673 
Gly Gly He Pro Val Val Ser His Leu Gly Leu Thr Pro Gin Ser Val 
130 135 140 



721 



aaa aaa tta ata gaa gac agt ata aaa tgc gaa gaa gca gga get atg 769 
Lys Lys Leu He Glu Asp Ser He Lys Cys Glu <Slu Ala Gly Ala Met 
165 * 170 175 

atg ctt gtg ctg gaa tgt gtg -ccg gca gaa etc aca gee aaa att gee 817 
Met Leu Val Leu Glu Cys Val Pro Ala Glu Leu Thr Ala Lys He Ala 
180 185 190 

gag acg eta age ata ecg gtc att gga ate ggg get ggt gtg aaa gcg 865 
Glu Thr Leu Ser He Pro Val He Gly He <31y Ala Gly Val Lys Ala 
195 200 205 

gac gga caa gtt etc gtt tat cat gat att ate ggc eac ggt gtt gag 913 
Asp Gly Gin Val Leu Val Tyr His Asp He He Gly His Gly Val Glu 
210 215 220 

aga aca cct aaa ttt gta aag caa tat acg cgc att gat gaa acc ate 961 
Arg Thr Pro Lys Phe Val Lys Gin Tyr Thr Arg He Asp Glu Thr He 
225 230 235 240 



1009 
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cct gaa caa aag cat tec ttt caa atg aac cag aca gtg ctt gac ggc 1057 
Pro <31u Gin Lys His Ser Phe Gin Met Asn Gin Thr Val Leu Asp Gly 
260 265 ' 270 

ttg tac ggg gga aaa taag atg aga cag att act gat att tea cag ctg 1106 
, Leu Tyr Gly Gly Lys Met Arg Gin He Thr Asp He Ser Gin Leu 

275 280 285 

aaa gaa gec ata aaa caa tac cat tea gag ggc aag tea ate gga ttt 1154 
Lys Glu Ala He Lys Gin Tyr His Ser Glu Gly Lys Ser He Gly Phe 
290 295 300 

gtt ccg acg atg ggg ttt ctg cat gag ggg cat tta acc tta gca gac 1202 
Val Pro Thr Met Gly Phe Leu His Glu Gly His Leu Thr Leu Ala Asp 
305 310 315 

aaa gca aga caa gaa aac gac gee gtt att atg agt att ttt gtg aat 1250 
Lys Ala Arg Gin Glu Asn Asp Ala Val He Met Ser He Phe Val Asn 
320 325 330 335 

cct gca caa ttc ggc ect aat gaa gat ttt gaa gca tat ccg cgc gat 1298 
Pro Ala Gin Phe Gly Pro Asn Glu Asp Phe Glu Ala 'Tyr Pro Arg Asp 
340 345 350 

att gag egg gat gca get ctt gca gaa aac gec gga gtc gat att ctt 134 6 
He Glu Arg Asp Ala Ala Leu Ala Glu Asn Ala Gly Val Asp He Leu 
355 360 365 

ttt acg- cca gat get cat gat atg tat ccc ggt gaa aag aat gtc acg 1394 
Phe Thr Pro Asp Ala His Asp Met Tyr Pro Gly Glu Lys Asn Val Thr 
370 375 380 

att cat gta gaa aga cgc aca gac gtg tta tgc ggg cgc tea aga gaa 1442 
He His Val Glu Arg Arg Thr Asp Val Leu Cys Gly Arg Ser Arg <31u 
385 390 395 

gga cat ttt gac ggg gtc gcg ate gta ctg acg aag ctt ttc aat eta ' 14 90 
. Gly His Phe Asp Gly Val Ala He Val Leu Thr Lys Leu Phe Asn Leu 
400 405 410 415 

gtc aag ccg act cgt gee tat ttc ggt tta aaa gat gcg cag cag gta 1538 
Val Lys Pro Thr Arg Ala Tyr Phe Gly Leu Lys Asp Ala Gin Gin Val 
420 425 430 

get gtt gtt gat ggg tta ate age gac ttc ttc atg gat att gaa ttg 1586 
Ala Val Val Asp Gly Leu He Ser Asp Phe Phe Met Asp He Glu Leu 
435 440 445 

gtt cct gtc gat acg gtc aga gag gaa gac ggc tta gec aaa age tct 1634 
Val Pro Val Asp Thr Val Arg Glu Glu Asp Gly Leu Ala Lys Ser Ser 
450 455 460 

cgc aat gta tac tta aca get gag gaa aga aaa gaa gcg cct aag ctg 1682 
Arg Asn Val Tyr Leu Thr Ala Glu <Slu Arg Lys Glu Ala Pro Lys Leu 
465 470 475 

tat egg gee ctt caa aca agt gcg gaa ctt gtc caa gec ggt gaa aga 1730 
Tyr Arg Ala Leu Gin Thr Ser Ala Glu Leu Val Gin Ala Gly Glu Arg 
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480 485 490 495 

gat cct gaa gcg gtg ata aaa get gca aaa gat ate att gaa acg aet 1778 
Asp Pro Glu Ala Val He Lys Ala Ala Lys Asp He He Glu Thr Thr 
1 500 505 510 



age gga acc ata gac tat gta gag <:tt tat tec tat ecg gaa , etc gag 
Ser Gly Thr He Asp Tyr Val Glu Leu Tyr Ser Tyr Pro Glu Leu Glu 
515 520 525 

cct gtg aat gaa att get gga aag atg att etc get gtt gca gtt get 
Pro Val Asn Glu He Ala Gly Lys Wet He Leu Ala Val Ala Val Ala 
530 535 540 

ttt tea aaa gcg cgt tta ata gat aat ate att att gat att cga gaa 
Phe Ser Lys Ala Arg Leu lie Asp Asn He He He Asp lie Arg Glu 
545 " 550 555 



agg gca act gtt acg gaa gca aac etg aac tat gtg gga age att aca 
Arg Ala Thr Val Thr Glu Ala Asn Leu Asn Tyr Val Gly Ser He Thr 
57 5 580 585 590 

att gat gaa gat etc att gat get gtg gga atg ctt cct aat gaa aaa 
He Asp Glu Asp Leu He Asp Ala Val Gly -Met Leu Pro Asn -Glu Lys 
595 600 605 



att cct ggt aaa egg gga age ggc gte ata tgc tta aac ggt gca gec 
He Pro Gly Lys Arg Gly Ser Gly Val He Cys Leu Asn tSly Ala Ala 
625 630 635 



1826 



1874 



1922 



atg gag aga ata taat atg tat ega aca atg atg age ggc aaa ctt -cac 1971 
Met Glu Arg He Met Tyr Arg Thr Met Met Ser <Sly Lys Leu His 

560 565 570 



2019 



2067 



gta caa att gtg aat aat aat aat gga gca cgt ctt gaa acg tat att 2115 
Val Gin He Val Asn Asn Asn Asn <31y Ala Arg Leu Glu Thr Tyr He 
610 . 615 620 



2163 



gca cgc ctt gtg cag gaa gga gat aag gte att att att tec tae aaa 2211 
Ala Arg Leu Val Gin Glu Gly Asp Lys Val He He He Ser Tyr Lys 
640 645 650 

atg atg tct gat caa gaa gcg gca age eat gag ccg aaa gtg get gtt 2259 
Met Met Ser Asp Gin Glu Ala Ala Ser His Glu Pro Lys Val Ala Val 
655 660 665 670 



ctg aat gat caa aac aaa att gaa caa atg ctg ggg aac gaa eea gee 
Leu Asn Asp Gin Asn Lys He <Slu Gin Met Leu <Sly Asn <Slu Pro Ala 
675 680 685 



2307 



cgt aca att ttg tagaagaaaa geccecttta tcgggggttt -tctt-ttaaga tttt 2363 
Arg Thr He Leu 
690 



<210> 60 
<211> 293 
<212> PRT 
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<213> Bacillus subtilis 
<400> 60 

Met Ser He Ala Val Ser Glu Glu Glu Ala Lys Ala Val Glu- Gly Leu 
1 ,5 10 15 

Asn Asp Tyr Leu Ser Val Glu Glu Val Glu Thr He Tyr He Pro Leu 
20 25 30 

Val Arg Leu Leu His Leu His Val Lys Ser Ala Ala Glu Arg Asn Lys 
35 40 45 

His Val Asn Val Phe Leu Lys His Pro His Ser Ala Lys He Pro Phe 
50 55 60 

He He Gly He Ala Gly Ser Val Ala Val Gly Lys Ser Thr Thr Ala 
65 70 75 80 

Arg He Leu Gin Lys Leu Leu Ser Arg Leu Pro Asp Arg Pro Lys Val 
85 " 90 ' 95 

Ser Leu He Thr Thr Asp Gly Phe Leu Phe Pro Thr Ala Glu Leu Lys 
100 105 HO 

Lys Lys Asn Met Met Ser Arg Lys Gly Phe Pro Glu Ser Tyr Asp Val 
115 120 125 

Lys Ala Leu Leu Glu Phe Leu Asn Asp Leu Lys Ser Gly Lys Asp Ser 
130 135 140 

Val Lys Ala Pro Val Tyr Ser His Leu Thr Tyr Asp Arg Glu Glu Gly 
145 150 155 " 160 

Val Phe Glu Val Val Glu Gin Ala Asp He Val He He Glu Gly He 
165 170 175 

Asn Val Leu Gin Ser Pro Thr Leu Glu Asp Asp Arg Glu Asn Pro Arg 
180 185 190 

He Phe Val Ser Asp Phe Phe Asp Phe Ser He Tyr Val Asp Ala Glu 
195 200 205 

Glu Ser Arg lie Phe Thr Trp Tyr Leu <31u Arg Phe Arg Leu Leu Arg 
210 215 220 

Glu Thr Ala Phe Gin Asn Pro Asp Ser Tyr Phe His Lys Phe Lys Asp 
225 230 * 235 240 

Leu Ser Asp Gin Glu Ala Asp Glu Met Ala Ala Ser He Trp Glu Ser 
245 250 255 

Val Asn Arg Pro Asn Leu Tyr Glu Asn He Leu Pro Thr Lys Phe Arg 
260 265 270 

Ser Asp Leu lie Leu Arg Lys Gly Asp Gly His Lys Val Glu Glu Val 
275 280 285 



Leu Val Arg 



Arg Val 
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<:210> 61 
<211> 281 
<212> PRT 

<213> Bacillus subtilis i 
<400> 61 

Met Glu Gly Leu Asn Asp Tyr Leu Ser Val -Glu Glu Val Glu Thr lie 
1 "* 5 It) 15 

Tyr lie Pro Leu Val Arg Leu Leu His Leu His Val Lys Ser Ala Ala 
20 25 30 

. Glu Arg Asn Lys His Val Asn Val Phe Leu Lys . His Pro His Ser Ala 
35 .40 4 5 

Lys He Pro Phe He He Gly He Ala <31y Ser Val Ala Val Gly Lys 
50 55 60 i 

Ser Thr Thr Ala Arg He Leu Gin Lys Leu Leu Ser Arg Leu Pro Asp 
65 70 75 80 

•Arg Pro Lys Val Ser, Leu He Thr Thr Asp -Gly Phe Leu Phe Pro Thr 

85 ■ • ' ' 90 '■ ■ 95 ■ 

Ala Glu Leu Lys Lys Lys Asn Met Met Ser Arg Lys -Gly Phe Pro -Glu 
100 105 H*0 

Ser Tyr Asp Val Lys Ala Leu Leu -Glu Phe Leu Asn Asp Leu Lys Ser 
115 120 125 

Gly Lys Asp Ser Val Lys Ala Pro Val Tyr Ser His ^Leu Thr Tyr Asp 
130 135 140 

Arg Glu Glu Gly Val Phe Glu Val Val -Glu Gin Ala Asp lie Val He 
145 ' 150 1*55 T60 

He Glu Gly He Asn Val Leu Gin Ser Pro Thr Leu Glu Asp Asp Arg 
165 170 . 175 

Glu Asn Pro Arg He Phe Val Ser Asp Phe -Phe Asp Phe Ser He Tyr 
180 185 190 

Val Asp Ala Glu Glu Ser Arg He Phe Thr Trp Tyr Leu *Glu Arg Phe 
195 200 205 

Arg Leu Leu Arg Glu Thr Ala Phe <31n Asn Pro Asp Ser Tyr Phe His 
210 ^ 215 220 

Lys Phe Lys Asp Leu Ser Asp Gin Glu Ala Asp Glu Met Ala Ala Ser 
225 230 235 240 

He Trp Glu Ser Val Asn Arg Pro Asn Leu Tyr tSlu Asn He Leu Pro 
245 250 255 

Thr Lys Phe Arg Ser Asp Leu He Leu Arg Lys Gly Asp <51y His Lys 
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260 265 270 



Val Glu Glu Val Leu Val Arg Arg Val 
275 280 



<210> 62 
<211> 1092 
<212> DNA 

<213> Bacillus subtilis 

<220> 

<221> CDS 

<222> (1) . . (1089) 



<400> 62 

atg act aaa caa aca att cgc gtt gaa ttg aca tea aca aaa aaa ccg 

Met Thr Lys Gin Thr He Arg Val Glu Leu Thr Ser Thr Lys Lys Pro 
15 10 15 



48 



aaa cca gac cca aat cag ctt teg ttc gga aga gtg ttt aca gac cac 96 
Lys Pro Asp Pro Asn Gin Leu Ser Phe Gly Arg Val Phe Thr Asp His 
20 25 30 

atg ttt gta atg gac tat gec gca gat aaa ggt tgg tac gat cca aga 14 4 
Met Phe Val Met Asp Tyr Ala Ala Asp Lys Gly Trp Tyr Asp Pro Arg 
35 40 45 

ate att cct tat caa ccc tta tea atg gat cca act gca atg gtc tat 192 
He He Pro Tyr Gin Pro Leu Ser Met Asp Pro Thr Ala Met Val Tyr 
50 55 * 60 

cac tac ggc caa acc gtg ttt gaa ggg tta aag get tac gtg tea gag 240 
His Tyr Gly Gin Thr Val Phe Glu Gly Leu Lys Ala Tyr Val Ser Glu 
65 70 75 J 80 

gat gac cat gtt ctg ctt ttc aga ccg gaa aaa aat atg gaa cgc ctg 288 
Asp Asp His Val Leu Leu Phe Arg Pro Glu Lys Asn Met Glu Arg Leu 
85 90 95 

aat caa tea aac gac cgc etc tgc ate ccg caa att gat gaa gaa cag 336 
Asn Gin Ser Asn Asp Arg Leu £ys n e Pro Gin He Asp Glu Glu Gin 
100 105 no 

gtt ctt gaa ggc tta aag cag ctt gtc gca att gat aaa gac tgg att 384 
Val Leu Glu Gly Leu Lys Gin Leu Val Ala lie Asp Lys Asp Trp He 
115 120 125 

cca aat gcg gag ggc acg tec ctt tac ate cgt ccg ttc ate ate gca 432 
Pro Asn Ala Glu Gly Thr Ser Leu Tyr He Arg Pro Phe He He Ala 
130 135 140 

acc gag cct ttc ctt ggt gtt gcg gca tct cat acg tat aag etc ttg 480 
Thr Glu Pro Phe Leu Gly Val Ala Ala Ser His Thr Tyr Lys Leu Leu 
145 150 155 160 

ate att ctt tct ccg gtc ggc tct tat tac aaa gaa ggc att aag ccg 528 
He He Leu Ser Pro Val Gly Ser Tyr Tyr Lys Glu Gly He Lys Pro 
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165 170 175 

gtc aaa ate get gtt gaa agt ^gaa ttt gtc cgt gcg gta aaa ggc gga 576 
yal Lys He Ala Val Glu Ser Glu Phe Val Ar^ Ala Val Lys Gly <Sly 
180 1S5 190 

aca gga aat gec aaa ace gca gga aae tat get tea age tta ,aaa ^cg 624 
Thr Gly Asn Ala Lys Thr Ala Gly Asn Tyr Ala Ser Ser Leu Lys Ala 
195 200 205 



cag cag gta gee gaa gag aaa gga ttt tet caa gta -etc tgg ctg gac 
Gin Gin Val Ala Glu Glu Lys Gly Phe Ser Gin Val Leu Trp Leu Asp 
210 215 220 

ggc att gag aag aaa tac ate gaa gaa gtc gga age atg aae ate ttc 
Gly He Glu Lys Lys Tyr He Glu Glu Val <Gly Ser Met Asn lie Phe 
225 230 235 240 

ttc aaa ate aac ggt gaa ate gta aca ccg atg ctg aac ggg age ate 
Phe Lys He Asn Gly Glu He Val Thr Pro Met Leu As*n Gly Ser He 
24 5 250 255 

ctg gaa ggc att acg cgc aat tea gtc ate gee *tg ett aag cat tgg 
Leu Glu Gly He Thr Arg Asn Ser Val He Ala Leu Leu Lys His Trp 
260 265 270 

ggc ctt caa gtt tea gaa cga aaa att gcg ate gat gag gtc ate caa 
Gly Leu Gin Val Ser Glu Arg Lys He Ala Il« Asp Glu Val He *Gln 
275 280 285 

gec cat aaa gac ggc ate ctg gaa gaa gee -ttc ^gga aca ^rgt aca gca 
Ala His Lys Asp Gly He Leu Glu -Glu Ala Phe Gly Thr Gly Thr Ala 
290 295 300 

get gtt att tec cca gtc ggc gag ctg ate tgg cag gat «gaa aca -ctt 
Ala Val He Ser Pro Val Gly Glu l*eu He Trp Gin Asp Glu Thr Leu 
305 310 315 320 

teg ate aac aac ggt gaa aca gga gaa ate gca aaa aaa eta tat -gac 
Ser He Asn Asn Gly Glu Thr Gly Glu He Ala Lys Lys Leu Tyr Asp 
325 330 335 

acg att aca ggc att caa aaa ggc get gtc gca gac gaa ttc gga -tgg 
Thr He Thr Gly He Gin Lys tSly Ala Val Ala Asp Glu Phe <31y Trp 
340 345 350 

acg ace gaa gtc gca gcg ctg act gaa age aag taa 
Thr Thr Glu Val Ala Ala Leu Thr -Glu Ser Lys 
355 360 



<210> 63 
<211> 363 
<212> PRT 

<213> Bacillus subtilis 



672 



720 



768 



81* 



864 



912' 



960 



1008 



1056 



1092 



<400> 63 

Met Thr Lys Gin Thr He Arg Val -Glu Leu Thr Ser Thr Lys Lys Pro 
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10 



15 



Lys Pro Asp -Pro Asn Gin Leu Ser Phe Gly Arg Val Phe Thr Asp His 
20 25 30 

Met Phe Val Met Asp Tyr Ala Ala Asp Lys Gly Trp Tyr Asp Pro Arg 
35 40 45 

He He Pro Tyr Gin Pro Leu Ser Met Asp Pro Thr Ala Met Val Tyr 
50 55 60 

His Tyr Gly Gin Thr Val Phe Glu Gly Leu Lys Ala Tyr Val Ser Glu 
65 70 75 80 

Asp Asp His Val Leu Leu Phe Arg Pro Glu Lys Asn Met Glu Arg Leu 
85 90 95 

Asn Gin Ser Asn Asp Arg Leu Cys He Pro Gin He Asp Glu Glu Gin 
100 105 ^ no 

Val Leu Glu Gly Leu Lys Gin Leu Val Ala He Asp Lys Asp Trp He 
115 120 125 

Pro Asn Ala Glu Gly Thr Ser Leu Tyr He Arg Pro Phe lie lie Ala 
130 135 140 

Thr Glu Pro Phe Leu Gly Val Ala Ala Ser His Thr Tyr Lys Leu Leu 
145 150 155 160 

He He Leu Ser Pro Val Gly Ser Tyr Tyr Lys Glu Gly lie Lys Pro 
165 170 175 

Val Lys He Ala Val Glu Ser Glu Phe Val Arg Ala Val Lys Gly Gly 
180 185 190 

Thr Gly Asn Ala Lys Thr Ala Gly Asn Tyr Ala Ser Ser Leu Lys Ala 
195 200 205 

Gin Gin Val Ala Glu Glu Lys Gly Phe Ser Gin Val Leu Trp Leu Asp 
210 215 - 220 

Gly He Glu Lys Lys Tyr lie Glu Glu Val Gly Ser Met Asn lie Phe 
225 230 235 240 

Phe Lys He Asn Gly Glu lie Val Thr Pro Met Leu Asn Gly Ser He 
245 250 255 

Leu Glu Gly He Thr Arg Asn Ser Val lie Ala Leu Leu Lys His Trp 
260 265 270 

Gly Leu Gin Val Ser Glu Arg Lys He Ala lie Asp Glu Val lie Gin 
275 280 285 

Ala His Lys Asp Gly He Leu Glu Glu Ala Phe Gly Thr Gly Thr Ala 
2 90 295 300 



Ala Val He Ser Pro Val Gly Glu Leu He Trp Gin Asp Glu Thr Leu 
305 310 315 . 320 
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Ser He Asn Asn X31y Glu Thr Gly Glu He Ala Lys Lys Leu Tyr Asp 

325 330 335 

Thr He Thr Gly He Gin Lys Gly Ala Val Ala Asp Glu Phe -Gly Trp 
340 345 350 

t 

Thr Thr Glu Val Ala Ala Leu Thr Glu Ser Lys 
355 360 



<210> 64 
<211> 1071 
<212> DNA 

<213> Bacillus subtilis 

<220> 

<221> CDS 

<222> (1) . . (1068) 

t 

,<400> 64 

ttg aat aag ctt att gaa cga gaa aaa act gta tat tat aag gaa aag 
Met Asn Lys Leu He Glu Arg Glu Lys Thr Val Tyr Tyr Lys Glu Lys 
15 10 15 

ccc gac ccg tct tec ttg ggg ttt' 'gga caa tat ttt aca' gat tat atg' 
Pro Asp Pro Ser Ser Leu Gly Phe ^Gly Gin Tyr Phe Thr Asp Tyr Met 
20 25 30 

ttt gtg atg gac tac gaa gag ggg att gga tqg cat cat ccg aga att 
Phe Val Met Asp Tyr Glu Glu Gly He <Gly Trp His His Pro Arg He 
35 40 45 



aag gaa aaa gga acg tea ctg tat att cgt cct ttt gte att gec aca 

Lys Glu Lys Gly Thr Ser Leu Tyr He Arg Pro Phe Val He Ala Thr 
130 135 140 

gaa ccg agt etc ggt gtg aag gca tec agg age tat aca ttt atg ate 



48 



96 



144 



gcg ccg tac gca ccg ctt acg ctt gat ccg tct tea tct gtt ttt cat 192 
Ala Pro Tyr Ala Pro Leu Thr Leu Asp Pro Ser Ser Ser Val Phe His 
50 55 60 

tac ggc cag get gtt ttt gaa gga tta aaa gca tac aga aca gac gac 240 
Tyr Gly Gin Ala Val Phe Glu Gly Leu Lys Ala Tyr Arg Thr Asp Asp 
65 70 75 80 

ggc agg gtg ctg ctg ttc cgt ccg gat caa aat ate aaa egg <ctg aac 258 
Gly Arg Val Leu Leu Phe Arg Pro Asp 'Gin Asn He Lys Arg Leu Asn 
85 90 95 

aga teg tgt gag cgc atg age atg ccc cct tta gac gaa gag ctg gtg 336 
Arg Ser Cys Glu Arg Met Ser -Met Pro Pro Leu Asp <31u Glu Leu Val 

ioo 105 : HO 

ctt gag gca ttg acg caa tta gtt gag ctg gag aaa gat tgg gtt cca 3S4 
Leu Glu Ala Leu Thr Gin Leu Val Glu Leu Glu Lys Asp Trp Val Pro 
115 120 125 



432 



480 
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Glu Pro Ser Leu Gly Val Lys Ala Ser Arg Ser Tyr Thr Phe Met lie . 
145 150 155 160 

t 

gtg ctt teg cct gtc ggc tec tat tat ggc gac gat cag ctg« aag ccg 528 
Val Leu Ser Pro Yal Gly Ser Tyr Tyr Gly Asp Asp Gin Leu Lys Pro 
165 170 175 

gtt aga ate tat gtc gaa gat gag tat gtg agg gcg gtc aac gga gga 576 
Val Arg lie Tyr Val Glu Asp Glu Tyr Val Arg Ala Val* Asn Gly Gly 
180 185 190 

gtc ggg ttt gca aaa acg get gga aac tat gee gec agt ctt cag gca 624 
Val Gly Phe Ala Lys Thr Ala Gly Asn Tyr Ala Ala Ser Leu Gin Ala 
195 200 205 



cag egg aaa gcg aat gaa ctg ggc tat gac cag gta ctg tgg ctg gac 
Gin Arg Lys Ala Asn Glu Leu Gly Tyr Asp <Hn Val Leu Trp Leu Asp 
210 215 220 



672 



gee ate gaa aag aaa tat gtg gaa gaa gta ggg age atg aac ate ttt 720 

Ala He Glu Lys Lys Tyr Val Glu Glu Val Gly Ser Met Asn He Phe 
225 230 235 240 

ttc gtc ata aac ggg gaa get gtc aca cct get tta age gga age att 768 

Phe Val He Asn Gly Glu Ala Val Thr Pro Ala Leu Ser Gly Ser He 

245 250 255 

tta age ggg gtt aca cgt 1 gcg tct gcg att gaa ttg att cga age tgg 816 

Leu Ser Gly Val Thr Arg Ala Ser Ala He Glu Leu He Arg Ser Trp 

260 265 270 

ggc att ccg gtt cgt gaa gag aga ata teg att gat gag gtg tat gcg 864 

Gly He Pro Val Arg Glu Glu Arg He Ser He Asp Glu Val Tyr Ala 

275 280 285 

gee tct gca cgc gga gaa ttg aca gag gtc ttt ggc aca ggc acg gca 912 

Ala Ser Ala Arg Gly Glu Leu Thr Glu Val Phe Gly Thr Gly Thr Ala 

290 295 300 

gca gtc gtt acg cct gtc ggt gaa etc aac ate cat gga aaa acg gtg 960 

Ala Val Val Thr Pro Val Gly Glu Leu Asn He His Gly Lys Thr Val 

305 310 315 " 320 

att gta ggc gac ggg caa ate ggg gac etc teg aaa aag ctg tat gaa 1008 

He Val Gly Asp Gly Gin He Gly Asp Leu Ser Lys Lys Leu Tyr Glu 

325 330 ' 335 

acg ata aca gat att cag ctt ggc aag gta aaa ggc ccg ttt aac tgg 1056 

Thr He Thr Asp He Gin Leu Gly £ys Val Lys Gly Pro Phe Asn Trp 

340 345 350 

aca gtg gaa gtg tga 1071 
Thr Val Glu Val 
355 



<210> 65 
<211> 356 
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<212> PRT 

<213> Bacillus subtilis 
,<400> 65 

Met Asn Lys Leu lie Glu Arg Glu Lys Thr Val Tyr Tyr Lys <31u Lys 
15' 10 15 

i 

Pro Asp Pro Ser Ser Leu Gly Phe Gly Gin Tyr Phe Thr Asp Tyr Met 
20 25 30 

Phe Val Met Asp Tyr Glu Glu Gly lie *Gly Trp His His Pro Arg lie 
35 40 45 

Ala Pro Tyr Ala Pro Leu Thr Leu Asp Pro Ser Ser Ser Val Phe His 
50 55 60 

Tyr Gly Gin Ala Val Phe Glu Gly Leu Lys Ala Tyr Arg Thr Asp Asp 
65 70 75 80 

Gly Arg Val Leu Leu Phe Arg Pro Asp Gin Asn lie L^s Arg Leu Asn 
85 90 95 

Arg Ser Cys Glu Arg Met Ser Met Pro Pro Leu Asp Glu Glu Leu Val 
100 105 HO 

Leu Glu Ala Leu Thr Gin Leu Val' Glu Leu Glu Lys* Asp Trp Val Pro 
115 120 125 

Lys Glu Lys Gly Thr Ser Leu Tyr lie Arg Pro Phe Val lie Ala Thr 
130 ' 135 140 

Glu Pro Ser Leu Gly Val Lys Ala Ser Arg Ser Tyr Thr Phe Met He 
14 5 150 155 160 

Val Leu Ser Pro Val Gly Ser Tyr Tyr Gly Asp Asp Gin Leu Lys Pro 
165 170 175 

Val Arg He Tyr Val Glu Asp Glu Tyr Val Arg Ala Val Asn Gly -Gly 
180 185 190 

Val Gly Phe Ala Lys Thr Ala -Gly Asn Tyr Ala Ala Ser Leu Gin Ala 
195 200 205 

Gin Arg Lys Ala Asn Glu Leu Gly Tyr Asp Gin Val Leu Trp Leu Asp 
210 215 220 

Ala He Glu Lys Lys Tyr Val Glu -Glu Val 'Gly Ser Met . Asn He Phe 
225 ' 230 235 240 

Phe Val lie Asn Gly Glu Ala Val Thr Pro Ala Leu Ser *Gly Ser He 
245 250 255 

Leu Ser Gly Val Thr Arg Ala Ser Ala He Clu Leu He Arg -Ser Trp 
260 265 270 

Gly He Pro Val Arg Glu Glu Arg -He Ser He Asp Glu Val Tyr Ala 
275 280 285 
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Ala Ser Ala Arg Gly Glu Leu Thr Glu Val Phe Gly Thr Gly Thr Ala ' 
290 295 300 

Ala Val Val Thr Pro Val Gly Glu Leu Asn He His Gly Lys* Thr Val 
305 . 310 315 320 

He Val Gly Asp Gly <31n He Gly Asp Leu Ser Lys Lys Leu Tyr Glu 
325 330 335 

Thr He Thr Asp He Gin Leu Gly Lys Val Lys Gly Pro Phe Asn Trp 
340 345 350 

Thr Val Glu Val 
355 



<210> 66 
<211> 1428 

<212> DNA , 
<213> Bacillus subtilis 

<220> 

<221> CDS 

<222> (1) . . (1425) 

<400> 66 

atg tta aac ggc caa aaa gaa tat cgc gtg gaa aaa gac ttc ctt ggg 48 
Met Leu Asn Gly Gin Lys Glu Tyr Arg Val Glu Lys Asp Phe Leu Gly 
15 10 15 

gaa aaa caa att gaa gca gat gtt tat tac gga att cag acg etc cgt 96 
Glu Lys Gin He Glu Ala Asp Val Tyr Tyr Gly lie Gin Thr Leu Arg 
20 25 30 

get tct gaa aat ttt ccg ate aca gga tac aaa ate cat gag gaa atg 144 
Ala Ser Glu Asn Phe Pro He Thr Gly Tyr Lys He His Glu Glu Met 
35 40 45 

att aac gca ctg gcg att gtg aaa aaa get gcg get ctt gee aac atg 192 
He Asn Ala Leu Ala He Val Lys Lys Ala Ala Ala Leu Ala Asn Met 
50 55 60 

gac gtg aaa egg ctg tat gaa gga att ggc caa get ate gta caa gec 240 
Asp Val Lys Arg Leu Tyr Glu Gly He Gly Gin Ala He Val Gin Ala 
65 70 75 80 

get gac gag att ctg gaa ggc aag tgg cac gat cag ttt ate gtc gat 288 
Ala Asp Glu He Leu Glu Gly Lys Trp His Asp Gin Phe He Val Asp 
85 90 95 

ccg att cag ggc ggt gee gga act tct atg aac atg aac gcg aat gag 336 
Pro He Gin Gly Gly Ala Gly Thr Ser Met Asn Met Asn Ala Asn Glu 
100 105 110 

gtt ate gga aac egg gcg ctt gaa ate atg gga cat aaa aag gga gat 384 
Val He Gly Asn Arg Ala Leu Glu He Met Gly His Lys Lys Gly Asp 
115 120 125 
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tat ate cat tta agt cca'aac aca cat gtg aac atg tea cag tct cag 432 
Tyr lie His Leu Ser Pro Asn Thr His Val Asn Met Ser -Gin Ser Gin 
130 . 13S 140 



aac gat gtg ttc ccg act get ate cat att tec aca ttg aag etc tta 
Asn Asp Val Phe Pro Thr Ala lie His He Ser Thr Leu Lys Leu Leu 
145 150 155 160 

gaa aaa ctg ctg aaa aca atg gaa gat atg cat agt gtg ttt aaa caa 
Glu Lys Leu Leu Lys Thr Met Glu Asp Wet His Ser Val Phe Lys Gin 
165 170 175 



460 



526 



aaa gca cag gag ttt cac tct gtt att aaa atg ggc egg aca eac ett 576 
Lys Ala Gin Glu Phe His Ser Val He Lys Met Gly Arg Thr His Leu 
180 185 190 

caa gat gcg gtt ccg ate cgt ett ggc cag gaa ttc gaa* get tac age 624 
Gin Asp Ala Val Pro He Arg Leu <31y <Sln Glu Phe Glu Ala Tyr Ser 
195 200 205 

cgt gtt etc gag cgt gat ate aaa cga ate aag caa teg cgc cag cac 672 
Arg Val Leu Glu Arg Asp lie Lys Arg He Lys Gin Ser Arg Gin His 
210 215 220 



ctg tat gaa gtc aac atg ggc gca act get gtt ggt aca ggg ctg aac 
Leu Tyr Glu Val Asn Met Gly Ala Thr Ala Val Gly Thr «Gly Leu Asn 
225 230 235 240 



ccg cgc gec gga ett gcg gaa att tct ctg .cct gca cgt cag ccg ggt 
Pro Arg Ala Gly Leu Ala Glu He Ser Leu Pro Ala Arg Gin Pro Gly 
305 310 315 320 



720 



get gat cct gaa tat ate aaa cag gta gta aag eac ett get gat att 768 
Ala Asp Pro Glu Tyr He Lys <31n Val Val Lys His Leu Ala Asp He' 
24 5 250 255 

age ggg ett cct ett gtc ggc get gat cat ett gtt gat gcg aca caa 816 
Ser Gly Leu Pro Leu Val Gly Ala Asp His Leu Val Asp Ala Thr Gin 
260 -265 270 

aat aca gat gee tat aca gag gta tea get tea tta aaa gte tgc atg 8-64 
Asn Thr Asp Ala Tyr Thr Glu Val Ser Ala Ser Leu Lys Val Cys Wet 
275 280 285 

atg aac atg teg aag ate gca aac gae ctg egc tta atg gcg teg gga 912 
Met Asn Met Ser Lys He Ala Asn Asp Leu Arg Leu Met Ala Ser -Gly 
290 295 300 



960 



tea tct att atg ccg ggg aaa gtc aat ccg gtt atg gcg gag etg ate 1008 
Ser Ser He Met Pro Gly Lys Val Asn Pro Val Met Ala -Glu Leu He 
325 330 335 

aac caa att gcg ttc cag gtt ate gga aat gac aat aca ate tgc ett 1056 
Asn Gin He Ala Phe Gin Val lie 431y Asn Asp Asn Thr He Cys Leu 
340 345 350 

get tea gaa <fcc ggc eag ett gag ttg aac gtc atg gag eec gtg ett 1104 
Ala Ser <31u Ala Gly Gin Leu Glu Leu Asn Val Met <31u Pro Val Leu 
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355 360 365 

gtc ttt aat ttg ctt caa tec ate age ate atg aac aac ggc ttc cgt 1152 
Val Phe Asn Leu Leu Gin Ser He Ser He Met Asn Asn Gly Phe Aro 
370 375 380 

teg ttc act gac aac tgc tta aaa ggc att gaa gec aac gaa aag cgt 1200 
Ser Phe Thr Asp Asn Cys Leu Lys Gly He Glu Ala Asn Glu Lys Arc 
385 390 395 ' 4 00 

atg aag caa tac gta gaa aaa age gca ggc gtg ate aca get gtc aat 1248 
Met Lys Gin Tyr Val Glu Lys Ser Ala Gly Val He Thr Ala Val Asn 
405 410 415 

ccg cat ctt ggg tat gaa gcg gca get aga att gee agg gaa gca att i296 
Pro His Leu Gly Tyr Glu Ala Ala Ala Arg He Ala Arg Glu Ala He 
420 425 430 

atg aca ggg caa tct gtc egg gat ctt tgt ctg cag cat gat gtg ctg 1344 
Met Thr Gly Gin Ser Val Arg Asp Leu Cys Leu Gin His Asp Val Leu 
435 440 445 

act gaa gaa gaa ttg gat att att tta aac cca tat gag atg acc aaa 1392 
Thr Glu Glu Glu Leu Asp He lie Leu Asn Pro Tyr Glu Met Thr Lys 
450 455 460 - 

cca ggt ate gca ggg aaa gaa eta tta gaa aaa taa 1428 
Pro Gly He Ala Gly Lys Glu Leu Leu Glu Lys 
465 470 475 

<210> 67 
<211> 475 
<212> PRT 

<213> Bacillus subtilis 
<400> 67 

Met Leu Asn Gly Gin Lys Glu Tyr Arg Val Glu Lys Asp Phe Leu Gly 
15 10 15 

Glu Lys Gin He Glu Ala Asp Val Tyr Tyr Gly He Gin Thr Leu Arg 
20 25 30 

Ala Ser Glu Asn Phe Pro He Thr Gly Tyr Lys He His Glu Glu Met 
35 40 45 

He Asn Ala Leu Ala He Val Lys Lys Ala Ala Ala Leu Ala Asn Met 
50 55 60 

Asp Val Lys Arg Leu Tyr Glu Gly He Gly Gin Ala He Val -Gin Ala 
65 7 0 75 80 

Ala Asp Glu He Leu Glu Gly Lys Trp His Asp Gin Phe He Val Asp 
85 90 95 

Pro He Gin Gly Gly Ala Gly Thr Ser Met Asn Met Asn Ala Asn Glu 
100 105 no 
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Val He Gly Asn Arg Ala -Leu Glu He Met Gly His Lys Lys Gly Asp 
115 120 125 

Tyr He His Leu Ser Pro Asn Thr His Val Asn Met Ser Gin *Ser *Gln 
130 135 140 

Asn Asp Val Phe Pro Thr Ala He His He -Ser Thr Leu Lys Leu Leu 
145 150 155 160 

Glu Lys Leu Leu Lys Thr Met Glu Asp Met His Ser Val Phe Lys Gin 
165 170 175 

Lys Ala Gin Glu Phe His Ser Val He Lys Met Gly Arg Thr His Leu 
180 185 190 

Gin Asp Ala Val Pro He Arg Leu <Sly Gin Glu Phe -Glu Ala Tyr Ser 
195 200 205, 

Arg Val Leu Glu Arg Asp He Lys Arg He Lys Gin Ser Arg Gin His 
210 215 220 

Leu Tyr Glu Val Asn Met Gly Ala Thr Ala Val ^Gly Thr Gly Leu Asn 
225 230 235 240 

Ala Asp Pro Glu Tyr He Lys Gin Val Val Lys His Leu Ala Asp lie 

245 ' -250 255 

Ser Gly Leu Pro Leu Val Gly Ala Asp His Leu Val Asp Ala Thr Gin 
260 265 270 

Asn Thr Asp Ala Tyr Thr Glu Val Ser Ala Ser Leu Lys Val Cys Met 
275 280 285 

Met Asn Met Ser Lys He Ala Asn Asp Leu Arg Leu Met Ala Ser Gl^ 
290 295 300 

Pro Arg Ala Gly Leu Ala Glu He Ser Leu Pro Ala Arg Gin Pro Gly 
305 310 315 320 

Ser Ser He Met Pro Gly Lys Val Asn Pro Val Met Ala Glu Leu He 
325 "* " 330 335 

Asn Gin He Ala Phe Gin Val He Gly Asn Asp Asn Thr He Cys Leu 
340 345 350 

Ala Ser Glu Ala Gly Gin Leu Glu Leu Asn Val Met Glu Pro Val Leu 
355 360 355 

Val Phe Asn Leu Leu Gin Ser He Ser He Met Asn Asn Gly Phe Arg 
370 375 380 

Ser Phe Thr Asp Asn Cys Leu Lys Gly He Glu Ala Asn Glu Lys Arg 
385 390 395 400 

Met Lys Gin Tyr Val Glu Lys Ser Ala Gly Val He Thr Ala Val Asn 
405 410 415 



Pro His Leu Gly Tyr Glu Ala Ala Ala Arg He Ala Ar^ Glu Ala lie 
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420 425 430 

Met Thr Gly Gin Ser Val Arg Asp Leu Cys Leu Gin His Asp Val Leu 
435 440 445 

Thr Glu Glu Glu Leu Asp He He Leu Asn Pro Tyr Glu Met Thr Lys 
450 455 460 

Pro Gly He Ala Gly Lys Glu Leu Leu Glu Lys , 
465 470 475 



<210> 68 
<211> 768 
<212> DNA 

<213> Bacillus subtilis 

<220> 
<221> CDS 
<222> (1) . . (765) 

<400> 68 » 
atg aaa cga gaa age aac att caa gtg etc age cgt ggt caa aaa gat 
Met Lys Arg Glu Ser Asn He Gin Val Leu Ser Arg Gly <31n Lys Asp 
1 5 io 15 

cag cct gtg age eag att tat caa gta tea aca atg act tct eta tta 
Gin Pro Val Ser Gin lie Tyr Gin Val Ser Thr Met Thr Ser Leu Leu 
20 25 30 



48 



96 



gac gga gta tat gac gga gat ttt gaa ctg tea gag att ccg aaa tat 14 4 
Asp Gly Val Tyr Asp Gly Asp Phe Glu Leu Ser Glu He Pro Lys Tyr 
35 40 45 

gga gac ttc ggt ate gga ace ttt aac aag ctt gac gga gag ctg att 192 
Gly Asp Phe Gly He Gly Thr Phe Asn Lys Leu Asp Gly Glu Leu He 
50 55 60 

ggg ttt gac ggc gaa ttt tac cgt ctt cgc tea gac gga ace gcg aca 240 
Gly Phe Asp Gly Glu Phe Tyr Arg Leu Arg Ser Asp Gly Thr Ala Thr 
65 70 75 80 

ccg gtc caa aat gga gae cgt tea ccg ttc tgt tea ttt acg ttc ttt 288 
Pro Val Gin Asn Gly Asp Arg Ser Pro Phe Cys Ser Phe Thr Phe Phe 
85 90 95 

aca ccg gac atg acg cac aaa att gat gcg aaa atg aca cgc gaa gac 336 
Thr Pro Asp Met Thr His Lys He Asp Ala Lys Met Thr Arg Glu Asp 
100 105 no 

ttt gaa aaa gag ate aac age atg ctg cca age aga aac tta ttt tat 384 
Phe Glu Lys Glu He Asn Ser Met Leu Pro Ser Arg Asn Leu Phe Tyr 
115 120 125 

gca att cgc att gac gga ttg ttt aaa aag gtg cag aca aga aca gta 432 
Ala He Arg He Asp Gly Leu Phe Lys Lys Val Gin Thr Arg Thr Val 
130 135 140 
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gaa ctt caa gaa aaa cct »tac -gtg cca atg gtt -gaa gcg gtc aaa aca 480 
Glu Leu Gin Glu Lys Pro Tyr Val Pro Wet Val Glu Ala Val Lys Thr 
145 150 155 160 

i 

cag ccg att ttc aac ttc gac aac gtg aga gga acg att gta ggt ttc 528 
Gin Pro lie Phe Asn Phe Asp Asn Val Arg <51y Thr lie Val Gly Phe 
165 170 175 

ttg aca cca get tat gca aac -gga ate gec gtt tct ggc tat eac ctg 576 
Leu Thr Pro Ala Tyr Ala Asn Gly lie Ala Val Ser Gly Tyr His Leu 
180 185 190 

cac ttc att gac gaa gga -cgc aat tea ggc gga cac gtt *ttt gac tat 624 
His Phe lie Asp Glu Gly Arg Asn Ser Gly Gly His Val Phe Asp Tyr 
195 200 205 

gtg ctt gag gat tgc acg gtt acg att tct caa aaa atg t aac atg aat 672 
Val Leu Glu Asp <:ys Thr Val Thr lie Ser T Gln Lys Met Asn Met Asn 
210 . 215 220 

etc aga ctt ccg aac aca gcg gat tte ttt aat gcg aat ctg gat aac 720 
Leu Arg Leu Pro Asn Thr Ala Asp Phe Phe Asn Ala Asn Leu Asp Asn 
225 230 235 240 

cct gat ttt gcg aaa gat ate gaa aca act gaa gga age cct gaa taa 768 
Pro Asp Phe Ala Lys Asp He Glu Thr Thr Glu X31y Ser Pro Glu 
245 250 255 



<210> 69 
<211> 255 . 
<212> PRT 

<213> Bacillus subtilis 
<400> 69 

Met Lys Arg Glu Ser Asn lie Gin Val Leu Ser Arg Gly Gin Lys Asp 
15 10 15 

Gin Pro Val Ser Gin He Tyr Gin Val Ser Thr Met Thr Ser Leu Leu 
20 25 30 

Asp Gly Val Tyr Asp Gly Asp Phe Glu Leu Ser -Glu He Pro Lys Tyr 
35 40 45 

Gly Asp Phe Gly lie Gly Thr Phe Asn Lys Leu Asp ^Gly Glu Leu He 
50 55 60 

Gly Phe Asp Gly Glu Phe Tyr Arg Leu Arg Ser Asp Gly Thr Ala Thr 
65 " 70 75 80 

Pro Val Gin Asn Gly Asp Arg Ser Pro Phe -Cys Ser Phe Thr Phe Phe 
85 SO 95 

Thr Pro Asp Met Thr His Lys He Asp Ala Lys Met Thr Arg -Glu Asp 
100 105 110 

Phe Glu Lys Glu He Asn Ser Met Leu *ro Ser Arg Asn Leu -Phe Tyr 
115 120 125 
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Ala He Arg He Asp Gly Leu Phe Lys Lys Val Gin Thr Arg Thr Val 
130 135 140 

Glu Leu Gin Glu Lys Pro Tyr Val Pro Met Val Glu Ala Val Lys Thr 
» 145 150 155 160 

Gin Pro He Phe Asn Phe Asp Asn Val Arg Gly Thr He Val Gly Phe 
165 170 , 175 

Leu Thr Pro Ala Tyr Ala Asn Gly He Ala Val Ser Gly Tyr His Leu 
180 185 190 

His Phe He Asp Glu Gly Arg Asn Ser Gly Gly His Val Phe Asp Tyr 
195 200 205 

Val Leu Glu Asp Cys Thr Val Thr He Ser Gin Lys Met Asn Met Asn 
210 215 220 

Leu Arg Leu Pro Asn Thr Ala Asp Phe Phe Asn Ala Asn Leu Asp Asn 
225 230 235 240 

i 

' Pro Asp Phe Ala Lys Asp He Glu Thr Thr Glu Gly Ser Pro Glu 
245 250 255 



<210> 70 
<211> 1254 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . (1251) 

<400> 70 

atg aca ttc tec ctt ttt ggt gac aaa ttt acc cgc cac tec ggc att ' 4 
Met Thr Phe Ser Leu Phe Gly Asp Lys Phe Thr Arg His Ser Gly He 
1 5 io 15 

acg ctg ttg atg gaa gat ctg aac gac ggt tta cgc acg cct ggc gcg 9 
Thr Leu Leu Met Glu Asp Leu Asn Asp Gly Leu Arg Thr Pro Gly Ala 
20 25 30 

att atg etc ggc ggc ggt aat ccg gcg cag ate ccg gaa atg cag gac 
He Met Leu Gly Gly Gly Asn Pro Ala Gin He Pro Glu Met Gin Asp 
35 40 45 

tac ttc eag acg eta ctg acc gac atg ctg gaa agt ggc aaa gcg act 
Tyr Phe Gin Thr Leu Leu Thr Asp Met Leu Glu Ser Gly Lys Ala Thr 
50 55 60 

gat gca ctg tgt aac tac gac ggt cca cag ggg aaa acg gag eta etc ; 
Asp Ala Leu Cys Asn Tyr Asp Gly Pro Gin Gly Lys Thr Glu Leu Leu 
65 70 75 80 

aca ctg ctt gee gga atg ctg cgc gag aag ttg ggt tgg gat ate gaa ; 
Thr Leu Leu Ala Gly Met Leu Arg Glu Lys Leu Gly Trp Asp lie Glu 
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85 1 90 95 

cca cag aat att gca eta aca aac ggc age cag age gcg tfct ttc tac 33^ 
Pro Gin Asn lie Ala Leu Thr Asn -Gly 'Ser Gin Ser -Ala Phe Phe Tyr 
100 105 lit) 



tta ttt aac ctg ttt gec gga cgc cgt gec gat ggt egg gtc aaa aaa 
Leu Phe Asn Leu Phe Ala Gly Arg Arg Ala Asp Gly Arg Val Lys Lys 
115 120 125 



ctg gaa gaa gat ctg ttt gtc tct gcg cgt ccg aat att gaa ctg ctg 
Leu Glu Glu Asp Leu Phe Val Ser Ala Arg Pro Asn lie <Slu Leu Leu 
145 * 150 1S5 , 160 

ccg gaa ggc cag ttt aaa tac cac gtc gat ttt gag cat ctg cat att 
Pro, Glu Gly Gin Phe Lys Tyr His Val Asp Phe <31u His Leu His lie 
165 170 175 



ggc aat gtg att act- gac gaa gag ttg ctg aag ctt gac gcg ctg ggc 
Gly Asn Val lie Thr Asp Glu Clu Leu Leu Lys Leu Asp Ala Leu Gly 
195 200 205 

aat caa cac ggc att ccg -ctg gtg att gat aac get tat ggc gtc ccg 
Asn Gin His <Gly He Pro Leu Val He Asp Asn Ala Tyr Gly Val Pro 
210 " 215 220 

ttc ccg ggt ate ate ttc agt gaa gcg cgc ccg eta tgg aat ccg aat 
Phe Pro Gly He He Phe Ser t^lu Ala Arg Pro Leu Trp Asn Pro Asn 
225 230 235 240 

ate gtg ctg tgc atg agt ctt tec aag -ctg ggt eta cct ggc tec cgc 
lie Val Leu Cys Met Ser Leu Ser Lys Leu Gly Leu Pro Gly Ser Arg 
245 250 -255 



atg aac ggc att ate age ctg gca cet ggc ggt att ggt ccg gcg atg 
Met Asn Gly He He Ser Leu Ala Pro -Gly Gly He Cly Pro Ala Met 
275 280 285 



gtc ate aaa ccg ttt tac tac eag cgt gtt cag gaa act ate gee ate 
Val He Lys Pro Phe Tyr Tyr Gin Arg Val Gin -Glu Thr lie Ala lie 
305 310 315 320 



384 



gtg ctg ttc ccg ctt gca ccg gaa tac att ggc tat get gac gec gga 4 32 
Val Leu Phe Pro Leu Ala Pro Glu Tyr lie Gly Tyr Ala Asp Ala Gly 
130 135 140 



480 



528 



ggc gaa gaa ace ggg atg att tgc gtc tec egg ccg acg aat cca aca 576 
Gly Glu Glu Thr Gly Met lie Cys Val Ser Arg Pro Thr Asn Pro Thr 
180 185 190 



624 



672 



720 



768 



tgc ggc att ate ate gec aat gaa aaa ate ate aec gee ate acc aat 31*6 
Cys Gly He He He Ala Asn Clu Lys He lie Thr Ala He Thr Ash 
260 265 270 



8*4 



atg tgt gaa atg att aag cgt aae gat ctg ctg cgc ctg tct gaa aca 912 
Met Cys Glu Met He Lys Arg Asn Asp Leu Leu Arg Leu Ser Glu Thr 
290 295 300 



960 
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cac taa 
His 



<210> 71 
<211> 417 
<212> PRT 

<213> Escherichia coli 
<400> 71 



Met Thr Phe Ser Leu Phe Gly Asp Lys Phe Thr Arg His Ser Gly He 
15 10 15 

Thr Leu Leu Met Glu Asp Leu Asn Asp Gly Leu Arg Thr Pro Gly Ala 
20 25 30 

He Met Leu Gly Gly Gly Asn Pro Ala Gin He Pro Glu Met Gin Asp 
35 40 45 

Tyr Phe Gin Thr Leu Leu Thr Asp Met Leu Glu Ser Gly Lys Ala Thr 
50 55 60 

Asp Ala Leu Cys Asn Tyr Asp Gly Pro Gin Gly Lys Thr Glu Leu Leu 
65 70 75 80 

Thr Leu Leu Ala Gly Met Leu Arg Glu Lys Leu Gly Trp Asp He Glu 
85 90 95 

Pro Gin Asn He Ala Leu Thr Asn Gly Ser Gin Ser Ala Phe Phe Tyr 

100 105 no y 

Leu Phe Asn Leu Phe Ala Gly Arg Arg Ala Asp Gly Arg Val Lys Lys 
115 120 ~ 125 



1008 



1056 



-75- 

att cgc cgc tat tta ocg gaa aat cgc tgc ctg att cat aaa ccg gaa 

He Arg Arg Tyr Leu Pro Glu Asn Arg Cys Leu He His Lys Pro Glu 
325 330 335 

gga gcc att ttc etc tgg eta tgg ttt aag gat ttg ccc att acg acc 
' G1 y Ala Ile Phe Leu Trp Leu Trp Phe Lys Asp Leu Pro He Thr Thr 
340 345 350 

aag cag etc tat cag cgc ctg aaa gca cgc ggc gtg ctg atg gtg ccg 1104 
Lys Gin Leu Tyr Gin Arg Leu Lys Ala Arg Gly Val Leu Met Val Pro 
355 360 365 

ggg cac aac ttc ttc cca ggg ctg gat aaa ccg tgg ccg cat acg cat 1152 
Gly His Asn Phe Phe Pro Gly Leu Asp Lys Pro Trp Pro His Thr His 
370 375 380 

caa tgt atg cgc atg aac tac gta cca gag ccg gag aaa att gag gcg 1200 
Gin Cys Met Arg Met Asn Tyr Val Pro Glu Pro Glu Lys Ile Glu Ala 
385 3 ^0 395 400 

ggg gtg aag att ctg gcg gaa gag ata gaa aga gcc tgg get gaa agt 1248 
Gly Val Lys He Leu Ala Glu Glu Ile Glu Arg Ala Trp Ala Glu Ser 
405 410 415 



1254 
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Val Leu Phe Pro Leu Ala Pro Glu Tyr lie Gly Tyr Ala Asp Ala <31y 
130 135 140 

Leu Glu Glu Asp Leu Phe Val Ser Ala Arg -Pro Asn II* Glu Leu Leu 
145 150 155 1*0 

Pro Glu Gly Gin Phe Lys Tyr His Val Asp Phe Glu His Leu His lie 
165 170 175 

Gly Glu Glu Thr Gly Met ile <:ys Val Ser Arg Pro Thr Asn Pro Thr 
180 185 190 

Gly Asn Val Ile Thr Asp Glu Glu Leu Leu Lys Leu Asp Ala Leu Gly 
195 200 205 

Asn Gin His Gly Ile Pro Leu Val Ile Asp Asn Ala Tyr Gly Val Pro 
210 215 220 , 

Phe Pro Gly Ile Ile Phe Ser Glu Ala Arg Pro Leu Trp Asn Pro Asn 
225 J 230 235 240 

Ile Val Leu Cys Met Ser Leu Ser Lys Leu Gly Leu Pro Gly Ser Arg 
245 250 255 

Cys Gly He Ile Ile Ala Asn Glu Lys Ile lie Thr Ala Ile Thr Asn 
260 265 270 

Met Asn Gly Ile Ile Ser Leu Ala -Pro Gly -Gly lie *Gly Pro Ala Met 
275 280 285 

Met Cys Glu Met lie Lys Arg Asn Asp Leu Leu Arg Leu Ser Glu Thr 
290 295 300 

Val Ile Lys Pro Phe Tyr Tyr Gin Arg Val "Gin <51u Thr Ile Ala lie 
305 310 315 320 

Ile Arg Arg Tyr Leu Pro Glu Asn Arg Cys Leu lie His Lys Pro <31u 
325 330 335 

Gly Ala lie Phe Leu Trp Leu Trp Phe Lys Asp Leu Pro Ile Thr Thr 
340 345 350 

Lys Gin Leu Tyr Gin Arg Leu Lys Ala Arg Gly Val Leu Met Val Pro 
355 360 3£5 

Gly His Asn Phe Phe Pro Gly Leu Asp Lys Pro Trp Pro His Thr His 
370 375 380 

Gin Cys Met Arg Met Asn Tyr Val Pro *Glu Pro Glu Lys lie *Glu Ala 
385 390 395 400 

Gly Val Lys Ile Leu Ala Glu Glu Ile Glu Arg Ala Trp Ala Glu Ser 
405 410 415 



His 
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<210> 72 
<211> 8803 
<212> DNA 

<213> Artificial Sequence 
i <220> 

<223> Description of Artificial Sequence: Recombinant 
pAN294 plasmid 

<400> 72 

tgcgccgcta cagggcgcgt ccattcgcca ttcaggctgc gcaactgttg ggaagggcga 60 

tcggtgcggg cctcttcgct attacgccag ctggcgaaag ggggatgtgc tgcaaggcga 120 

ttaagttggg taacgccagg gttttcccag tcacgacgtt gtaaaacgac ggccagtgaa 180 
ttgtaatacg actcactata gggcgaattg ggcccgacgt cgcatgctgg atgaaaagcc 240 

gatgaccgct tttcaggtct gtcagcagct ttttcctgct gtatatgaaa aggaattgtt 300 
tttaacgatg tcagaaacgg caggtcacct tgatgtgttg gaggrctgaag aagccatcac 360 

' gtcatattgg gaaggaaata ccgtatactt taaaacaatg aagaggtgaa atgggtgaaa 420 

catatagcgg gaaaaaggat ttggataacc ggcgcttcag gagggcttgg agaaagaatc 480 

gcatacttat gcgcggctga aggagcecat gtcctgctgt cggctagacg cgaggatcgt 540 

ttgatagaaa tcaaaaggaa aataaccgag gaatggagcg gacagtgtga gatttttcct €00 

ctggatgtcg gccgcctaga ggatatcgcc cgggtccgcg atcagatcgg ctcgattgat 660 

gtactgatta acaatgcagg cttcggtata tttgaaacgg ttttagactc tacattggat 720 

gacatgaaag cgatgtttga tgtgaatgtc ttcggcctga tcgcctgtac aaaagcggtg 780 

cttccgcaaa tgcttgagca aaaaaaggga catatcatca atatcgcctc tcaagcgggg 840 

aaaatcgcca caccgaagtc tagcctgtat tccgcgacca aacatgccgt gttaggttac 900 

tcaaacgctt tgcggatgga gctttcggga accggcattt atgtgacaac agtcaacccg 960 

ggcccgattc agacggactt tttttccatt gctgataaag gcggggacta cgccaaaaat 1020 

gtcggccgct ggatgcttga tcctgatgac gtggcagctc aaattacagc tgcaattttt 1080 

acgaaaaagc gggagatcaa tcttccgcgt ttaatgaatg ccggcactaa gctgtatcag 1140 

ctgtttccag ctcttgtaga aaagctggca ggacgcgcgc tcatgaaaaa ataatgatag 1200 

aactgcctgt ggtggagtgg cttgtttctc acggggcagt ttttgatagt ggaagggaga 1260 

gattgttgaa tgtcagttca ttcagaagtc cttcatgctc tgcttaaaga tccgtttatt 1320 

cagaaactga ttgatgcaga gcctgtattc tgggcaaatt caggcaagaa agaggggcca 1380 

ttaccccgtg cagatgagtg ggcaaccgag atagcggaag cggaaaaaag aatgcagcgg 14*40 
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tttgcacctt acattgccga ggt-gtttcct gagacgaaag gcgctaaagg aatcatcgag 1500 
tctccgcttt ttgaggtgca gcatatgaag ggaaagctgg aagcggcata fccagcagcca 15€0 
tttcccggaa gatggctttt aaagtgcgac catgagcttc ogatttcagg atcgattaaa 1620 
gcgaggggcg ggatttatga agtgttaaag tatgctgaaa atctcgcgct tcaagaagga 1680 
atgcttcagg aaaccgatga ttaccgcatc ttacaggaag agcggtttac cgggtttttc 1740 
tcccgctatt cgattgctgt cggttcgaca ggaaatctag gtttaagcat -cggcatcatc. 1800 
ggcgcggcac tcgggtttcg cgtgacagtg catatgtccg ccgatgctaa gcagtggaaa I860 
aaggatctcc tccgccaaaa gggagtcact gttatggagt acgaaacaga ttacagtgaa 1920 
gcggtgaacg aagggagacg gcaggcggaa -caagatccat tctgttattt tattgatgat 1980 
gaacattctc gtcagctgtt cttaggatat gctgttgctg caagccgatt aaaaacacag 2040 
. cttgactgta tgaatataaa gccaagtctt gagacgccct tgtttgtgta tctgccgtgc 2100 
ggagtcggcg gaggaccggg cggtgtagca tttgggctga agcttttata <:ggagatgat 21*60 
• gttcatgtgt ttttcgcaga accaactcat tcaccttgta igctgttagg gctttattca 2220 
ggacttcacg agaagatctc cgtccaggat abcggcctgg ataatcagac ggctgctgac 2280 
ggacttgccg tagggaggcc gtcaggattt gt<:ggcaagc tgattgaacc gcttctgagc 2340 
ggctgttata cggtagagga caatacgctt tatactbtgc ttcatatgct ggctgtatct 2400 
gaagataaat atttagagcc ctctgctctt gctggcatgt tcgggccggt -tcagcttttt 2460 
tcgacagaag agggaaggcg ctatgctcag aaatataaga tggaacatgc <:gtacatgtc ^520 
. gtctggggaa cgggaggaag catggttcca aaagatgaaa tggctgcgta taaccgaatc 2580 
ggtgctgatt tgctaaaaaa acgaaatgga aaataagcag acagtgaaaa ggttttccgt 2640 
tacaatcttt gtaagggttt taaoctacag agagtcaggt gtaaacagtg aaaaataaag 2700 
aacttaacct acatacttta tatacacagc acaatcggga gtcttggfcct ggttttgggg -2760 
ggcatttgtc gattgctgta tctgaagaag aggcaaaagc tgtggaagga ttgaatgatt 2820 
atctatctgt tgaagaagtg gagacgatct atattccgct tgttcgcttg ctt-cafcttac 2880 
atgtcaagtc tgcggctgaa cgcaataagc atgtcaatgt ttttttgaag cacccacatt 2940 
cagccaaaat tccgtttatt atcggcattg <:cggcagtgt -cgcagtcgga aaaagcacga 3000 
cggcgcggat cttgcagaag ctgctttcgc gtttgcctga ccgtccaaaa gtgagcctta 3060 
tcacgacaga tggtttttta tttcctactg ccgagctgaa aaagaaaaat atgatgtcaa 3120 
gaaaaggatt tcctgaaagc tatgatgtaa aggcgctgct cgaatttttg aatgacttaa 3180 
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aatcaggaaa ggacagcgta aaggccccgg 
aaggtgtgtt cgaggttgta gaacaggcgg 
ttcagtcgcc caccttggag gatgaccggg 
ttgatttttc gatttatgtg gatgcggagg 
gttttcgcct gcttcgggaa acagcttttc 
aagacttgtc cgatcaggag gctgacgaga 
ggccgaattt atatgaaaat attttgccaa 
agggagacgg gcataaggtc gaggaagtgt 
tcgagcaata gttaccctta ttatcaagat 
aatcctttaa aaaaacacaa aagaccacat 
agcacccatt agttcaacaa acgaaaattg 
ttatgttaca gtaatattga cttttaaaaa 
gtaagcctcc taaattcact ttagataaaa 
aaaattgatt tagacaattg gaagagaaaa 
acgactttta gtataaccac agaaattgat 
gaaggatata aattttaccc tgcatttatt 
acagctttta gaactggtta caatagcgac 
ccactttata caatttttga tggtgtatct 
aagaatgact tcaaaga^tt ttatgattta 
tcggggaaat tgtttcccaa aacacctata 
ccatggactt catttactgg gtttaactta 
cccattatta cagcaggaaa attcattaat 
ttacaggtac atcattctgt ttgtgatggt 
caggaattgt cagataggcc taatgactgg 
ctttttacag tcggttttct aatgtcacta 
tattacagct gtcgactcgt gatcttcgga 
cagctgcgct tttcggtttt tcgcatactt 
ggcaaatata ataaatacaa acagctgaaa 
ctcatgtttg cgggagagat tcattctctt 
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tgtattccca 


tctaacctat gaccgcgagg 


3240 


atattgtgat 


tattgaaggc attaatgttc 


3300 


aaaacccgcg 


tatttttgtt tccgatttct 


3360 


aaagccggat 


tttcacttgg tatttagagc 


3420 


aaaatcctga 


ttcatatttt cataaattta 


3480 


tggcagcctc 


gatttgggag agtgtcaacc 


3540 


ctaaattcag 


gtcagatctc attttgcgta 


3600 


tggtaaggag 


ggtatgaaat gtgctgcagc 


3660 


aagaaagaaa 


aggatttttc gctacgctca 


3720 


tttttaatgt 


ggtctj:tatt cttcaactaa 


3780 


gataaagtgg 


gatattttta aaatatatat 


3840 


aggattgatt 


ctaatgaaga aagcagacaa 


3900 


atttaggagg 


catatcaaat gaactttaat 


3960 


gagatattta 


atcattattt gaaccaacaa 


4020 


attagtgttt 


tataccgaaa cataaaacaa 


4080 


ttcttagtga 


caagggtgat aaactcaaat 


4140 


ggagagttag 


gttattggga taagttagag 


4200 


aaaacattct 


ctggtatttg gactcctgta 


4260 


tacctttctg 


atgtagagaa atataatggt 


4320 


cctgaaaatg 


ctttttctct ttctattatt 


4380 


aatatcaata 


ataatagtaa ttaccttcta 


4440 


aaaggtaatt 


caatatattt accgctatct 


4500 


tatcatgcag 


gattgtttat gaactctatt 


4560 


cttttataat 


atgagataat gccgactgta 


4620 


acctgccccg 


ttagttgaag aaggttttta 


4680 


caggctgttc 


agctttttct caatgcgatc 


4740 


gaagcctgta 


acagccgcaa agacgacagc 


4800 


catcacatca 


cctatattca tgttcttcac 


4860 


ccgtttttta 


tttaaagcgg cttttccaga 


4920 
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cgggaacggt gttttgtggt ctccattttc atttgccgat aggcgaacgc taaaaatggc 4980 
aggccgagca gggtaatgcc gctcaggaca gaaaaaatat aaatcggccg gccagcgcca 5040 
' aacaggtcta tacatatccc cccgacccaa gggccgatga cgtttccgag -ctgtggaaaa 5100 
ccgattgccc cgaaataagt gccttttaat cctggttttg caatctggtc tacatacaaa 5160 
tccatcatag agaataaaag cacttcgccg attgtaaatg tgatgacaat -catcacaatt 5220 
gatggaacac cgtgtgatac ggtgaaaatg gccatgctga tgctaaccat <:acattaccg 5280 
agcatcagag aacaaagcgg cgaaaaccgt tttgcaaaat ggacaatggg aaattgcgtc 5340 
gccaacacaa cgattgcgtt taatgtcagc atcagcccat acagctt-cgt tccattgccg 5400 
atcaaggggt tctgcgccat atactgaggg aatgtggaac tgaattgtga gtagccgaag 5460 

gtgcatagcg taatgccgac caaagcaatg gtaaaaagat aatccttttg cgtgaccata 5520 

i 

, aacgcttccc gcacgctcat atttcgggac tgggctggtg ctgataagga tggatgtttt 5580 
ttaaattgga gggcaagcac aattccgtat agtccgtaaa -tgactgcagg caccaaaaag 564 0 

■ ggcgtagtcg attgcgatga gccgaaatat aggccaagca -caggtccgaa gacaacgccg 5700 
atattaatag ccgcatagcg taaattaaaa actagcagtc t-cgttttttc ttctgtcata 5760 
tcagacaaca aggcctttga agcgggctca aacagtgatt tgcaaagacc gtttaatgcg 5820 
tttactacaa aaaacaccca gagattagat gctgccgcaa agcctgcaaa -taecagcatc 5880 
catccgaaaa tcgatacaag catcatgttt tttctgccga atttatctga -gatatatccg 5940 
ccgtaaaagc ttgcgaggat gccgactgat gagctcgcgg cgatgaccag ccctgcatag -^000 
gaagctgatg cgccttggac ggctgtcaaa taaatcgcta aaaaaggaat gctcatcgat 6060 
gttgccattc tgccgaaaat ggttccgatt ataattgtac gcgtt-ggatg catagcttga 6120 
gtattctata gtgtcaccta aatagcttgg cgtaatcatg gtcatagctg -tttcctgtgt 6180 
gaaattgtta tccgctcaca attccacaca acatacgagc cggaagcata aagtgtaaag "6240 
cctggggtgc ctaatgagtg agctaactca cattaattgc gttgcgctca ctgcccgctt €300 
tccagtcggg aaacctgtcg tgccagctgc attaatgaat cggccaacgc gcggggagag 6360 
gcggtttgcg tattgggcgc tcttccgctt cctcgctcac tgactcgetg <:gctcggtcg 6420 
ttcggctgcg gcgagcggta tcagctcact caaaggcggt aatacggtta tccacagaat 6480 
caggggataa cgcaggaaag aacatgtgag caaaaggcca gcaaaaggcc aggaaccgta 6540 
aaaaggccgc gttgctggcg tttttcgata ggctccgccc ccctgacgag catcacaaaa 6600 
atcgacgctc aagtcagagg tggcgaaacc cgacaggact ataaagatac caggcgtttc 6660 
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cccctggaag 


ctccctcgtg 


cgctctcctg 


ttccgaccct 


gccgcttacc 


ggatacctg't 


6720 


ccgcctttct 


cccttcggga 


agcgtggcgc 


tttctcatag 


ctcacgctgt 


aggtatctca 


6780 


gttcggtgta 


ggtcgttcgc 


tccaagctgg 


gctgtgtgca 


cgaacccccc 


gttcagcccg 


6840 


accgctgcgc 


cttatccggt 


aactatcgtc 


ttgagtccaa 


cccggtaaga 


cacgacttat 


6900 


cgccactggc 


agcagccact 


ggtaacagga 


ttagcagagc 


gaggtatgta 


ggcggtgcta 


6960 


cagagttctt 


gaagtggtgg 


-cctaactacg 


gctacactag 


aaggacagta 


tttggtatct 


7020 


gcgctctgct 


gaagccagtt 


accttcggaa 


aaagagttgg 


tagctcttga 


tccggcaaac 


7080 


aaaccaccgc 


tggtagcggt 


ggtttttttg 


tttgcaagca 


gcagattacg 


cgcagaaaaa 


7140 


aaggatctca 


agaagatcct 


ttgatctttt 


ctacggggtc 


tgacgctcag 


tggaacgaaa 


7200 


actcacgtta 


agggattttg 


gtcatgagat 


tatcaaaaag 


gatcttcacc 


tagatccttt 


7260 


taaattaaaa 


atgaagtttt 


aaatcaatct 


aaagtatata 


tgagtaaact 


tggtctgaca 


7320 


gttaccaatg 


cttaatcagt 


gaggcaccta 


tctcagcgat 


ctgtctattt 


cgttcatcca 


7380 


tagttgcctg 


actccccgtc 


gtgtagataa 


ctacgatacg 


ggagggctta 


ccatctggcc 


7440 


ccagtgctgc 


aatgataccg 


cgagacccac 


gctcaccggc 


tccagattta 


rcagcaataa 


7500 


accagccagc 


cggaagggcc 


gagcgcagaa 


gtggtcctgc 


aactttatcc 


gcctccatcc 


7560 


agtctattaa 


ttgttgccgg 


gaagctagag 


taagtagttc 


gccagttaat 


agtttgcgca 


7620 


acgttgttgg 


cattgctaca 


ggcatcgtgg 


tgtcacgctc 


gtcgtttggt 


atggcttcat 


7680 


tcagctccgg 


ttcccaacga 


tcaaggcgag 


ttacatgatc 


ccccatgttg 


tgcaaaaaag 


7740 


cggttagctc 


cttcggtcct 


ccgatcgttg 


tcagaagtaa 


gttggccgca 


gtgttatcac 


7800 


tcatggttat 


ggcagcactg 


cataattctc 


ttactgtcat 


gccatccgta 


agatgctttt 


7860 


ctgtgactgg 


tgagtactca 


accaagtcat 


tctgagaata 


ccgcgcccgg 


cgaccgagtt 


7920 


gctcttgccc 


ggcgtcaata 


cgggataata 


gtgtatgaca 


tagcagaact 


ttaaaagtgc 


7980 


tcatcattgg 


aaaacgttct 


tcggggcgaa 


aactctcaag 


gatcttaccg 


ctgttgagat 


8040 


ccagttcgat 


gtaacccact 


cgtgcaccca 


actgatcttc 


agcatctttt 


actttcacca 


8100 


gcgtttctgg 


gtgagcaaaa 


acaggaaggc 


aaaatgccgc 


aaaaaaggga 


ataagggcga 


8160 


cacggaaatg 


ttgaatactc 


atactcttcc 


tttttcaata 


ttattgaagc 


atttatcagg 


8220 


gttattgtct 


catgagcgga 


tacatatttg 


aatgtattta 


gaaaaataaa 


caaatagggg 


8280 


ttccgcgcac 


atttccccga 


aaagtgccac 


ctgtatgcgg 


tgtgaaatac 


cgcacagatg 


8340 


cgtaaggaga 


aaataccgca 


tcaggcgaaa 


ttgtaaacgt 


taatattttg 


ttaaaattcg 


8400 
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cgttaaatat ttgttaaatc agctcatttt ttaaccaata ggccgaaatc ggcaaaatcc -84-60 
cttataaatc aaaagaatag accgagatag ggttgagtgt tgttccagfct tggaacaaga "8520 
' gtccactatt aaagaacgtg gactccaacg tcaaagggcg aaaaaccgtc tatcagggcg 8580 
atggcccact acgtgaacca -tcacccaaat caagt-tt-tt-t -gcggtcgagg tgccgtaaag 8640' 
ctctaaatcg gaaccctaaa gggagccccc gatttagagc ttgacgggga aagccggcga 8700 
acgtggcgag aaaggaaggg aagaaagcga aaggagcggg cgctagggcg ctggcaagtg 8760 
tagcggtcac gctgcgcgta accaccacac ccgccgcgct taa 8803 

<210> 73 
<211> 8320 
<212> DNA 

<213> Artificial Sequence 

» 

<220> 

' <223> Description of Artificial Sequence: .Recombinant 
pAN296 plasmid 

<400> 73 

tgcgccgcta cagggcgcgt ccattcgoca ttcaggctgc gcaactgttg ggaagggcga 60 
tcggtgcggg cctcttcgct attacgccag ctggcgaaag ggggatgtgc tgcaaggcga 120 
ttaagt;tggg taacgccagg gttttcccag tcacgacgtt gtaaaacgac ggccagtgaa 180 
ttgtaatacg actcactata gggcgaattg ggcccgacgt -cgcatgctgg atgaaaagcc 240 
gatgaccgct tttcaggtct gtcagcagct ttttcctgct gtatatgaaa aggaattgtt 300 
tttaacgatg tcagaaacgg caggtcacct tgatgtgttg gaggctgaag aagccatcac 360 
gtcatattgg gaaggaaata ccgtatactt taaaacaatg aagaggtgaa atgggtgaaa 420 
catatagcgg gaaaaaggat ttggataacc ggcgcttcag gagggcttgg agaaagaatc 480 
gcatacttat gcgcggctga aggagcccat gtcctgctgt cggctagacg cgaggatcgt 540 
ttgatagaaa tcaaaaggaa aataaccgag gaatggagcg -gacagtgtga gatttt<tcct 600 
ctggatgtcg gccgcctaga ggatat-cgcc cgggtccgcg atcagatcgg ctcgattgat 660 
gtactgatta acaatgcagg cttcggtata tttgaaacgg *tttagactc tacattggat 720 
gacatgaaag cgatgtttga tgtgaatgtc ttcggcctga tcgcctgtac aaaagcggtg 780 
cttccgcaaa tgcttgagca aaaaaaggga <:atatcatca atatcgcctc tcaagcgggg 840 
aaaatcgcca caccgaagtc tagcctgtat tccgcgacca aacatgccgt gttaggttac 900 
tcaaacgctt tgcggatgga gctttcggga accggcattt atgtgacaac agtcaacccg 960 
ggcccgattc agacggactt ttttt<rcatt gctgataaag gcggggacta cgccaaaaat 1O20 
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gtcggccgct ggatgcttga 



tcctgatgac gtggcagctc 



aaattacagc tgcaattttt 



1080 



acgaaaaagc gggagatcaa tcttccgcgt ttaatgaatg ccggcactaa .gctgtatcag 1140 



aactgcctgt 


ggtggagtgg 


cttgtttctc acggggcagt 


ttttgatagt ggaagggaga 


1260 


gattgttgaa 


tgtcagttca 


ttcagaagtc cttcatgctc 


tgcttaaaga tccgtttatt 


1320 


cagaaactga 


ttgatgcaga 


gcctgtattc tgggcaaatt 


caggcaagaa agaggggcca 


1380 


ttaccccgtg 


cagatgagtg 


ggcaaccgag atagcggaag 


cggaaaaaag aatgcagcgg 


1440 


tttgcacctt 


acattgccga 


ggtgtttcct gagacgaaag 


gcgctaaagg aatcatcgag 


1500 


tctccgcttt 


ttgaggtgca 


gcatatgaag ggaaagctgg 


aagcggcata tcagcagcca 


1560 


tttcccggaa 


gatggctttt 


aaagtgcgac catgagcttc 


cgatticagg atcgattaaa 


1620 


gcgaggggcg 


ggatttatga 


agtgttaaag tatgctgaaa 


atctcgcgct tcaagaagga 


1680 


atgcttcagg 


aaaccgatga 


ttaccgcatc ttacaggaag 


agcggtttac cgggtttttc 


1740 


tcccgctatt 


cgattgctgt 


cggttcgaca ggaaatctag 


gtttaagcat cggcatcatc 


1800 


ggcgcggcac 


tcgggtttcg 


cgtgacagtg catatgtccg 


ccgatgctaa gcagtggaaa 


1860 


aaggatctcc 


tccgccaaaa 


gggagtcact gttatggagt 


acgaaacaga ttacagtgaa 


1920 


gcggtgaacg 


aagggagacg 


gcaggcggaa caagatccat 


tctgttattt tattgatgat 


1980 


gaacattctc 


gtcagctgtt 


cttaggatat gctgttgctg 


caagccgatt aaaaacacag 


2040 


cttgactgta 


tgaatataaa 


gccaagtctt gagacgccct 


tgtttgtgta tctgccgtgc 


2100 


ggagtcggcg 


gaggaccggg 


cggtgtagca tttgggctga 


agcttttata cggagatgat 


2160 


gttcatgtgt 


ttttcgcaga 


accaactcat . tcaccttgta 


tgctgttagg gctttattca 


2220 


ggacttcacg 


agaagatctc 


cgtccaggat atcggcctgg 


ataatcagac ggctgctgac 


2280 


ggacttgccg 


tagggaggcc 


gtcaggattt gtcggcaagc 


tgattgaacc gcttctgagc 


2340 


ggctgttata 


cggtagagga 


caatacgctt tatactttgc 


ttcatatgct ggctgtatct 


2400 


gaagataaat 


atttagagcc 


ctctgctctt gctggcatgt 


tcgggccggt tcagcttttt 


2460 


tcgacagaag 


agggaaggcg 


ctatgctcag aaatataaga 


tggaacatgc cgtacatgtc 


2520 


gtctggggaa 


cgggaggaag 


catggttcca aaagatgaaa 


tggctgcgta taaccgaatc 


2580 


ggtgctgatt 


tgctaaaaaa 


acgaaatgga aaataagcag 


acagtgaaaa ggttttccgt 


2640 


tacaatcttt 


gtaagggttt 


taacctacag agagtcaggt 


gtaaacagtg aaaaataaag 


2700 


aacttaacct 


acatacttta 


tatacacagc acaatcggga 


gtcttctgca gctcgagcaa 


2760 



ctgtttccag ctcttgtaga 



aaagctggca ggacgcgcgc 



tcatgaaaaa ataatgatag 



1200 
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tagttaccct tattatcaag ataagaaaga aaaggatttt t-cgctacgct caaatccttt 2820 
^aaaaaaacac aaaagaccac attttttaat gtggtctt-ta ttcttcaact aaagcaccca 2880 
ttagttcaac aaacgaaaat tggataaagt gggatatttt taaaatatat atttatgtta 2940 
cagtaatatt gacttttaaa aaaggattga ttctaatgaa gaaagcagac aagtaagcct 3000. 
cctaaattca ctttagataa aaatttagga ggcatatcaa atgaacttta ataaaattga 3060 
tttagacaat izggaagagaa aagagatatt taatcattat ttgaaccaac aaacgacttt 3120 
tagtataacc acagaaattg atattagcgt tttataccga aacataaaac aagaaggata 3180 
, taaattttac cctgcattta ttttcttagt gacaagggtg ataaactcaa atacagcttt 3240 
tagaactggt tacaatagcg acggagagtt aggttattgg gataagttag agccacttta 3300 
tacaattttt gatggtgtat ctaaaacatt ctctggtatt -tggactcctg ta'aagaatga 3360 
cttcaaagag ttttatgatt -tatacctttc tgatgtagag aaatataatg gttcggggaa 34 20 
attgtttccc aaaacaccta tacctgaaaa tgctttttct ctttctatta ttccatggac 3480 
ttcatttact gggtttaaot taaatat<:aa -taataatagt aattactttc tacccattat 3S40 
tacagcagga aaattcatta ataaaggtaa ttcaatatat ttaccgctat -ctttacaggt 3600 
acatcattct gtttgtgatg gttateatgc aggattgttt atgaactcta ttcaggaatt 3660 
gtcagatagg cctaatgact ggcttttata atatgagata atgccgaetg tactttttac 3720 
agtcggtttt ctaatgtcac taacctgccc cgttagfctga agaaggtttt tatattacag 3780 
ctgtcgacta aggtcgagga agtgttggta aggagggtat gaaatgtgca tcatattgaa 3840 
ctgtatgtct ctgatttgga ggcgtctagg cggttttggg ^gct-ggttctt aaaagaactt 3900 
ggttataaag agtatcaaaa atggagctca ggcatcagct ggaagaaaga tcgtttttac 3960 
ctagtgattg tgcaggcgaa agagccattt ctagagccgg aataccatag atgccgagto 4020 
ggtctgaacc atctcgcatt t<:atgctgaa tccaagcttc aagtcgatca gatgactgaa 4O80 
aaattgacgg caaaaggcta tcgtgtgttg taccgagaca ggcatccttt tgccggagga 4140 
gacgggcatt atgcagtctt ttgtgaggat ccagaccgga ttaaggtaga gctcgttgcc 4200 
ccaagctgtt aatcgtgatc ttcggacagg ctgttcagct ttttctcaat gcgatccagc 4260 
tgcgcttttc ggtttttcgc atacttgaag cctgtaacag ccgcaaagac gacagcggca 4320 
aatataataa atacaaacag ctgaaacatc acatcaccta tattcatgtt ctt<cacctca 4380 
tgtttgcggg agagattcat tctcttccgt tttttattta aagcggcttt tccagacggg 4440 
aacggtgttt tgtggtctcc attttcattt gccgataggc gaacgctaaa aatggcaggc 4500 
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cgagcagggt aatgccgctc aggacagaaa aaatataaat cggccggcca gcgccaaaca 4560 
ggtctataca tatccccccg acccaagggc cgatgacgtt tccgagctgt ggaaaaccga 4 620 
ttgccccgaa ataagtgcct tttaatcctg gttttgcaat ctggtctaca tacaaatcca 4,680 
tcatagagaa taaaagcact tcgccgattg taaatgtgat gacaatcatc acaattgatg 4740 
gaacaccgtg tgatacggtg aaaatggcca tgctgatgct aaccatcaca ttaccgagca 4800 
tcagagaaca aagcggcgaa aaccgttttg caaaatggac aatgggaaat tgcgtcgcca 4 860 
acacaacgat tgcgtttaat gtcagcatca gcccatacag cttcgttcca ttgccgatca 4 920 
aggggttctg cgccatatac tgagggaatg tggaactgaa ttgtgagtag ccgaaggtgc 4980 
atagcgtaat gccgaccaaa gcaatggtaa aaagataatc cttttgcgtg accataaacg 5040 
cttcccgcac gctcatattt cgggactggg ctggtgctga taaggatgga tgttttttaa 5100 
attggagggc aagcacaatt ccgtatagtc cgtaaatgac tgcaggcacc aaaaagggcg 5160 
tagtcgattg cgatgagccg aaatataggc caagcacagg tccgaagaca acgccgatat 5220 
taatagccgc atagcgtaaa ttaaaaacta gcagtctcgt tttttcttct gtcatatcag 5280 
acaacaaggc ctttgaagcg ^gctcaaaca gtgatttgca aagaccgttt aatgcgttta 5340 
ctacaaaaaa cacccagaga ttagatgctg ccgcaaagcc tgcaaatacc agcatccatc 5400 
cgaaaatcga tacaagcatc atgttttttc tgccgaattt atctgagata tatccgccgt 54 60 
aaaagcttgc gaggatgccg actgatgagc tcgcggcgat gaccagccct gcataggaag 5520 
ctgatgcgcc ttggacggct gtcaaataaa tcgctaaaaa aggaatgctc atcgatgttg 5580 
ccattctgcc gaaaatggtt ccgattataa ttgtaacgcg ttggatgcat agcttgagta 5640 
ttctatagtg tcacctaaat agcttggcgt aatcatggtc atagctgttt cctgtgtgaa 5700 
attgttatcc gctcacaatt ccacacaaca tacgagccgg aagcataaag tgtaaagcct 5760 
ggggtgccta atgagtgagc taactcacat taattgcgtt gcgctcactg cccgctttcc 5820 
agtcgggaaa cctgtcgtgc cagctgcatt aatgaatcgg ccaacgcgcg gggagaggcg 5880 
gtttgcgtat tgggcgctct tccgcttcct cgctcactga ctcgctgcgc tcggtcgttc 5940 
ggctgcggcg agcggtatca gctcactcaa aggcggtaat acggttatcc acagaatcag 6000 
gggataacgc aggaaagaac atgtgagcaa aaggccagca aaaggccagg aaccgtaaaa 6060 
aggccgcgtt gctggcgttt ttcgataggc tccgcccccc tgacgagcat cacaaaaatc €120 
gacgctcaag tcagaggtgg cgaaacccga caggactata aagataccag gcgtttcccc 6180 
ctggaagctc cctcgtgcgc tctcctgttc cgaccctgcc gcttaccgga tacctgtccg 6240 
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cctttctccc ttcgggaagc gtggcgcttt ctcatagctc acgctgtagg tatctcagtt -6300 
cggtgtaggt cgttcgctcc aagctgggct gtgtgcacga accccccgtt cagcccgacc 6360 
gctgcgcctt atccggtaac tatcgtcttg agtccaaccc ggtaagacac gacttatcgc 6420 
cactggcagc agccactggt aacaggatta gcagagcgag gtatgtaggc ggtgctacag 6480, 
agttcttgaa gtggtggcct aactacggct acactagaag gacagtattt ggtatctgcg 6540 
ctctgctgaa gccagttacc ttcggaaaaa gagttggtag ctcttgatcc ggcaaacaaa 6600 
ccaccgctgg tagcggtggt ttttttgttt gcaagcagca gattacgcgc agaaaaaaag £660 
, gatctcaaga agatcctttg atcttttcta cggggtctga cgctcagtgg aacgaaaact €720 
cacgttaagg gattttggtc atgagattat caaaaaggat cttcacctag atccttttaa 6780 
atcaaaaatg aagttttaaa tcaatctaaa gtatatatga gtaaacttgg tctgacagtt 6840 
accaatgctt aatcagtgag gcacctatct cagcgatctg tctatttcgt tcatccatag 6900 
ttgcctgact ccccgtcgtg tagataacta cgatacggga gggcttacca tctggcccca 6960. 
gtgctgcaat gataccgcga gacccacgct caccggctcc agatttattca gcaataaacc 7020 * 
agccagccgg aagggccgag cgcagaagtg gtcctgcaac tttatccgcc tcca-tccagt 7080 
ctattaattg ttgccgggaa gctagagtaa gtagttcgcc agttaatagt ttgcgcaacg 7140. 
ttgttggcat tgctacaggc atcgtggtgt cacgctcgtc gtttggtatg gcttcattca 7200. 
gctccggttc ccaacgatca aggcgagtta catgatcccc catgttgtgc aaaaaagcgg 7260; 
ttagctcctt cggtcctccg atcgttgtca gaagtaagtt ggccgcagtg ttatcactca 7320 
tggttatggc agcactgcat aattctctta ctgtcatgcc atccgtaaga tgcttttctg 7380 
tgactggtga gtactcaacc aagtcattct gagaataccg cgcccggcga ccgagttgct 7440 
cttgcccggc gtcaatacgg gataatagtg tatgacatag cagaacttta aaagtgctca 7500 
tcattggaaa acgttcttcg gggcgaaaac tctcaaggat cttaccgctg i:tgagatcca 7560 
gttcgatgta acccactcgt gcacccaact gatcttcagc atcttttact ttcaccagcg 7-620 
tttctgggtg agcaaaaaca ggaaggcaaa atgccgcaaa aaagggaata agggcgacac 7€80 
ggaaatgttg aatactcata ctcttccttt ttcaatatta ttgaagcatt tatcagggtt 7740 
attgtctcat gagcggatac atat*tgaat gtatttagaa aaataaacaa ataggggbtc 7800 
cgcgcacatt tccccgaaaa gtgccacctg tatgcggtgt gaaataccgc acagatgcgt 7860 
aaggagaaaa taccgcatca ggcgaaattg taaacgttaa tattttgtta aaattcgcgt 7920 
taaatatttg ttaaatcagc tcatttttta accaataggc cgaaatcggc aaaatccctt 7980 
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ataaatcaaa agaatagacc gagatagggt tgagtgttgt tccagtttgg aacaagagtc 8040 

i 

cactattaaa gaacgtggac tccaacgtca aagggcgaaa aaccgtctat cagggcgatg 8100 
gcccactacg tgaaccatca cccaaatcaa gttttttgcg gtcgaggtgc cgtaaagctc 8^60 
taaatcggaa ccctaaaggg agcccccgat ttagagcttg acggggaaag ccggcgaacg 8220 
tggcgagaaa ggaagggaag aaagcgaaag gagcgggcgc tagggcgctg gcaagtgtag 8280 
cggtcacgct gcgcgtaacc accacacccg ccgcgcttaa 8320 

<210> 74 
<211> 250 
<212> PRT 

<213> Clostridium acetobutylicum 

<400> 74 1 
Asn Lys Arg Ala Ala Phe Met Leu Leu Leu Phe Leu Arg Ser Val Leu 
1 5 io 15 

Lys Val He Leu Val Leu Asp Val Gly Asn Thr Asn He Val Leu Gly 
20 25 30 

He Tyr Asn Asp Thr Lys Leu Thr Ala Glu Trp Arg Leu Ser Thr Asp 

35 ■ 40 45 

Val Leu Arg Ser Ala Asp Glu Tyr Gly He Gin Val Met Asn Leu Phe 
50 55 60 

Gin Gin Asp Lys Leu Asp Pro Thr Leu Val Glu Gly Val He He Ser 
65 "70 75 80 

Ser Val Val Pro Asn He Met Tyr Ser Leu Glu His Met He Arg Lys 
85 90 95 

Tyr Phe Lys He Asn Pro Leu Val Val Gly Pro Gly He Lys Thr Gly 
100 105 ~ no 

He Asn He Lys Tyr Asp Asn Pro Lys Glu Val Gly Ala Asp Arg He 
115 120 125 

Val Asn Ala Val Ala Ala His Glu He Tyr Lys Arg Ser Leu He He 
130 . 135 140 

He Asp Phe Gly Thr Ala Thr Thr Phe Cys Ala Val Arg Glu Asn Gly 
145 150 155 160 

Asp Tyr Leu Gly Gly Ala He Cys Pro Gly He Lys Val Ser Ser Glu 
165 170 " 175 

Ala Leu Phe Glu Lys Ala Ala Lys Leu Pro Arg Val Glu Leu He Lys 
180 185 190 

Pro Ala Tyr Ala He Cys Lys Asn Thr lie Ser Ser He Gin Ser Gly 
195 200 205 
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Ile Val Tyr Arg Tyr Leu Arg Gin Val Lys Tyr Leu £he Glu Lys Leu 
210 215 220 

' Lys Glu Asn Leu Pro Asp Gly Arg Arg Thr Arg Thr Ser Leu Val Leu 
225 230 235 240 

Ala Thr Gly Gly Leu Ala Lys Leu lie Asn • 

245 250 



<210>. 75 
<211> 258 
<212> PRT 

<213> Rhodobacter capsulatus 
<400> 75 

Met Leu Leu Cys He Asp Cys Gly Asn Thr Asn Thr Val Phe Ser Val 

15 10 15 

i 

Trp Asp Gly Thr Asp Phe Ala Ala Thr Trp Arg He /Via Thr Asp His 
20 25 30 

Arg Arg Thr Ala Asp Glu Tyr Phe Val Trp Leu Asn Thr Leu Met Gin 
35 40 45 

Leu Lys Gly Leu Gin Gly Arg He -Ser Glu Ala He He Ser Ser Thr 
50 55 60 

Ala Pro Arg Val Val Phe Asn Leu Arg Val Leu Cys Asn Arg Tyr Phe 
65 70 75 <8C 

Asp Cys Arg Pro Tyr Val Val Gly Lys Pro Gly Cys -Glu Leu Pro Val 
85 90 95 

Ala * Pro Arg Val Asp Pro Gly Thr Thr Val Gly Pro Asp Arg Leu Val 
100 105 110 

Asn Thr Val Ala Gly Tyr Asp Arg His Gly Gly Asp Leu He Val Val 
115 120 125 

Asp Phe Gly Thr Ala Thr Thr Phe Asp Val Val Ala Pro Asp -Gly Ala 
130 135 140 

Tyr He Gly Gly Val He Ala Pro Gly Val Asn Leu Ser Leu *Glu Ala 
145 150 155 ' 1-60 

Leu His Met Ala Ala Ala Ala Leu Pro His Val Asp Val Thr Lys Pro 
165 170 175 

Gin Gly Val He Gly Thr Asn Thr Val Ala Cys He -Gin Ser Gly Val 
180 185 190 

Tyr Trp Gly Tyr lie Gly Leu Val Glu Gly He Val Arg Gin lie Arg 
195 200 205 

Met Glu Arg Asp Arg Pro Met Lys Val He Ala Thr <Gly Gly Leu Ala 
210 215 220 
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Ser Leu Phe Asp Leu Gly Phe Asp 
225 230 

Leu Thr Met His Gly Leu Arg Leu 
245 

Gly Ala 



-89- 

Leu Phe Asp Lys Val Glu Asp Asp 
235 240 

lie Phe Asp Tyr Asn Lys Gly Leu 
250 255 



<210> 76 
<211> 10801 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Recombinant 
pAN240 plasmid 



<400> 76 
gaattttgcg 


gccgcttcga 


aagctgtaat 


ataaaaacct 


tcttcaacta 


acggggcagg 


60 


ttagtgacat 


tagaaaaccg 


actgtaaaaa 


gtacagtcgg 


cattatctca 


tattataaaa 


120 


gccagtcatt 


aggcctatct 


gacaattcct 


gaatagagtt 


cataaacaat 


cctgcatgat 


180 


aaccatcaca 


aacagaatga 


tgtacctgta 


aagatagcgg 


taaatatatt 


gaattacctt 


240 ' 


tattaatgaa 


ttttcctgct 


gtaataatgg 


gtagaaggta 


attactatta 


ttattgatat 


300 


ttaagttaaa 


cccagtaaat 


gaagtccatg 


gaataataga 


aagagaaaaa 


gcattttcag 


360 


gtataggtgt 


tttgggaaac 


aatttccccg 


aaccattata 


tttctctaca 


tcagaaaggt 


420 


ataaatcata 


aaactctttg 


aagtcattct 


ttacaggagt 


ccaaatacca 


gagaatgttt 


480 


tagatacacc 


atcaaaaatt 


gtataaagtg 


gctctaactt 


atcccaataa 


cctaactctc 


540 


cgtcgctatt 


gtaaccagtt 


ctaaaagctg 


tatttgagtt 


tatcaccctt 


gtcactaaga 


600 


aaataaatgc 


agggtaaaat 


ttatatcctt 


cttgttttat 


gtttcggtat 


aaaacactaa 


660 


tatcaatttc 


tgtggttata 


ctaaaagtcg 


tttgttggtt 


caaataatga 


ttaaatatct 


720 


cttttctctt 


ccaattgtct 


aaatcaattt 


tattaaagtt 


catttgatat 


gcctcctaaa 


780 


tttttatcta 


aagtgaattt 


aggaggctta 


cttgtctgct 


ttcttcatta 


gaatcaatcc 


840 


ttttttaaaa 


gtcaatatta 


ctgtaacata 


aatatatatt 


ttaaaaatat 


cccactttat 


900 


ccaattttcg 


tttgttgaac 


taatgggtgc 


tttagttgaa 


gaataaagac 


cacattaaaa 


960 


aatgtggtct 


tttgtgtttt 


tttaaaggat 


ttgagcgtag 


cgaaaaatcc 


ttttctttct 


1020 


tatcttgata 


ataagggtaa 


ctattgaatt 


cggtaccaag 


agtttgtaga 


aacgcaaaaa 


1080 


ggccatccgt 


caggatggcc 


ttctgcttaa 


tttgatgcct 


ggcagtttat 


ggcgggcgtc 


1140 
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ctgcccgcca ccct-ccgggc cgttgcttcg caacgttcaa atccgctecc ggcggatttg 1200 
tcctactcag gagagcgttc accgacaaac aacagataaa acgaaaggcc cagtctt^-cg 1240 
actgagcctt tcgttttatt tgatgcctgg -cagttcccta cfcctcgcatg gggagacccc 1320 
acactaccat cggcgctacg gcgtttcact tct^agttcg gcatggggtc aggtgggacc 1380 
accgcgctac tgccgccagg caaattctgt tttatoagac cgcttctgcg ttctgat<tta 1440 
atctgtatca ggctgaaaat ctfcctctcat ccgccaaaac aggatcctac ggaaatggag 1500 
cggcaaaacc gttttactct caaaatctta aaagaaaacc cccgataaag ggggcttttc 15-60 
ttctacaaaa ttgtacgggc tggttcgttc -cccagcattt gttcaatttt gttttgatca 1620 
ttcagaacag ccactttcgg ctcatggctt gccgcttctt gatcagacat cattttgtag 1680 
gaaataataa tgaccttatc tccttcctgc acaaggcgtg -cggctgcacc gtttaagcat 1~M0 
atgacgccgc ttccccgttt accaggaata atatacgttt caagacgtgc tccattatta 1800 
ttattcacaa tttgtacttt ttcattagga agcattccca -cagcatcaat gagatcttca 18S0 
tcaattgtaa tgcttcccac atagttcagg tttgcttccg taacagttgc octgtgaagt 1920 
ttgccgctca tcattgttcg atacatatta tattct-ctcc atttctcgaa tatcaataat 1980 
gatattatct attaaacgcg cttttgaaaa agcaactgca acagcgagaa tcatctttcc 2040 
agcaatttca ttcacaggct cgagttccgg ataggaataa agctctacat agtctatggt 2100 
tccgctagtc gtttcaatga tatcttttgc agcttttatc accgcttoag gatcfectfctc 2160 
accggcttgg acaagttccg cacttgtttg aagggcccga tacagcttag gcgcttcttt 2220 
tctttcctca gctgttaagt atacattgcg agagcttttg gctaagccgt cttcctotot 2280 
gaccgtatcg acaggaacca attcaatatc catgaagaag t«cgctgatta acccatx^iac -2340 
aacagctacc tgctgcgcat cttttaaacc gaaataggca ogagtcggct tgactagatt 2400 
gaaaagcttc gtcagtacga tcgcgacccc gtcaaaatgt cct<tctcttg agcgcccgca 2460 
taacacgtct gtgcgtcttt ctacatgaat cgtgacattc tttfccaccgg gatacatatc 2520 
atgagcatct ggcgtaaaaa gaatatcgac tccggcgttt tctgcaagag ctgcatcccg 2580 
ctcaatatcg cgcggatatg cttcaaaatc ttcattaggg ccgaattgtg caggatt-cac 2640 
aaaaatactc ataataacgg cgtcgttttc ttgtcttgct tt^tctgcta aggttaaatg 2700 
cccctcatgc agaaacccca tcgtcggaac aaatccgatt gact-tgccct ctgaatggta 27*60 
ttgttttatg gcttctttca ^gctgtgaaat atcagtaatc tgtctcatct tattttcccc 2820 
cgtacaagcc gfccaagcact gtctggttca tttgaaagga atgcttttgt tcagggaaag 2880 
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cacgatgtct tacatcctga acatatccgc 
gcgtatattg ctttacaaat ttaggtgttc 
aaacgagaac ttgtccgtcc gctttcacac 
gcgtctcggc aattttggct gtgagttctg 
ctcctgcttc ttcgcatttt atactgtctt 
tgccctgtac tttatagccg cccagtacgc 
tgactactgg aatgcctcca agcgtcaatg 
cctcaagctt cagtgcgtca gctccgcttt 
tatcttcctt agacaggtga taagacataa 
gcgcacccct tttaacggct tttgtatgat 
ttgaatcaag gccgaggacg accattccaa 
ccgcttgttc agcaagttta gctgccggat 
cttcagactc cttcattttt agaaaatcca 
gagcgtcctg ctgttgttaa gattattata 
ttgcaaaatt tttcaggaat tttagcagag 
ccgctcttgt gctgttagga tatctttctt 
ttggttaaaa ggaaacaaaa agaccgtttt 
tacagtcaca gccacttttg caaaaaccgg 
gtcgacgatg atctgccgtt ttcttctgca 
gattcttcac tttctaaagt tcggcgagtt 
caaggcagac tgctgcaatg tctatctatt 
tgcccaatgg aagtatacca aaatcaacgg 
atcggcagaa tagatttttt taatgccttc 
atctatacct aacgccttac cgtaaaaggt 
gccaataccg agcgcatagc tttcccataa 
ataatcccgg taccgtttct caaagcatag 
aattttttcc ccagcagctt ggctgaggct 
atcattaggc tgctggttaa cgttaatgcc 
ttcttcagcc tgcatttccg ttaggatacc 



-91 - 



tgattgctgt 


ttcgatggtt 


tcatcaatgc 


2940 


tctcaacacc 


gtggccgata 


atatcatgat 


3000 


cagccccgat 


tccaatgacc 


ggtatgctta 


3060 


ccggcacaca 
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3720 
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3780 


ggaagctagg 
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3840 


cacacaaaac 
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gatttctttt 
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acagcttcat 


gccttataac 


tgctgtttcg 


3960 


agccaaaaaa 


ccttccgtta 


caacgagaag 


4020 


tcatccctct 
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ttttttggat 


4080 


ttaataatag 


gtgcagttcg 


caggcgatac 


4140 
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4200 


gttcgtttct 


aaaagcagaa 


cgccttcatc 


4260 
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4440 


tgtcgcgatg 
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4560 
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4620 
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atttggccat ttaatatccg tttggatgcc tgctgcctct tctattccct gcacaacagc 4680 
tactgcagca agcagagtca gctgcggtgt tttttggagc ggaatgbcag gccgcaaaat 4740 
caggctcatc caaacaccgb ttccttcttg agaatgccat accctagaca ttcggcccct 4800 
tccggctgtt tgtttgtcag ccaccacaag ggtgccttcc ggtgcgttat tattcgpgag 4860 
ctcatgagcc gttttttgcg tgcttgaaag aacgtcatgg taaataagat gctggcccat 4920 
cacttccgtt tttaatccaa aacgaatttc gctttcactg agttttccgg gbtttttgat 4980 
gagccgatat ccttttcttc taacggcttc tacttcataa cccfcctttcc gaagctcttc 5040 
aatatgcttc cacacagcag ttcttgaaca gccgagagca tcactgattt tttggccgga 5100 
aataaattca ttgccggcct gagaaaataa ttcaataagg tcttttctta atgttgaccg 5160 
catgtcttca gccactcctc tatgtgtttc ttttgattgg agagattccc tgtcacaaca 5220 
' gcctgctcga tccactgtaa ttcttctgac acecatttt-c cggccggccg gtttcgaagc 5280 
gcaagcaagt ccttacccgt gatatcaaga tccttaaggc ttttgatcgg -caggttttga 5340 
taagcgtact gaatgtcctt cagtttcttt tcatccagtt tttcgttttg ccgaagctgc 5400. 
gatattttgg ccgctgagag cagtgctttt ttcccagctc tgtacattgt cattgcgbca 5460 
aggctctggc caaacgtatc ggcaatgtga atggcttcct tgatcacttt tcccgggagc 5520 
ttccaggctt tcaggaaaag gggcgcgtct ttcaaaacta fcgccaaggtt aattaaaaga 5580 
gcagcccaaa gctcctcacg ggatgttaaa gagaagaatg gaaactcact cgttgaaat-c 5640 
aggttttctc gtttatgata aaaaccagga agctcttcat acaatctcgt ttgaatgagt 57O0 
gtttgaagcg cctggcgaga agctcttccc tgcagcaatt fectcaaactc tatagttttt 5760 
cgttcgactg aaacatggga gaggagtgat ttttctttcg -caatggcttc ttctgtttcc 5820 
ggtgaaagcg taaagccaag ctggctcata aagcgtacgg ctctcagcat acgaagcgca 5880 
tcctcttgaa atctatcctc aggctttcca acggttcgaa tcactttctg afccaatatct 5940 
ttcttgccgc caaaataatc aagcaccttc ccgtccgctg tcatggccat cgcattgatc 6000 
gttaaatctc tgcgttttag atcctcttct aatgatgaga taaattgcac ttctgacggt 6060 
cttctgaaat caacataatc agattcagtc cggaatgtcg tgacttcata ggtttcatcc 6120 
tcccagagca caataatggt cccgtgctct ttgcctacat caacagtccg ctgaaacagc -6180 
cgttctactt gatcaggtgc cgcatctgtc gcgatatcga catctccgat -cgttcgtttc 6240 
atatagctgt cacgaactgc gcccccgaca aaataagcct gatggcccgc ttcgattaag "6300 
atgcggagca cgggaagtgc tttgataaaa actttttcca tgtgatcact ccggttctgc -6360 
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atgttgaata ctcatactct toctttttca 
tctcatgagc ggatacatat ttgaatgtat 

cacatttccc cgaaaagtgc <:acctgacgt 
ctataaaaat aggcgtatca cgaggccctt 
caactgcggg tcaaggatct ggatttcgat 
ggctccaagg atcgggcctt gatgttaccc 
gggaattgat ccggtggatg accttttgaa 
ttggggaccc tagaggtccc <cttttttatt 
gcataacggg ttttgctgcc -cgcaaacggg 
atcgcagatc cggcttcagg tttgccggct 

' attttttccc cacgggaggc gtcactggct 
gcagcatcgc ctgtttcagg ctgtctatgt 

' ggtgttcaat ttcatgttct agttgctttg 
ttacatgctg ttcatctgtt acattgtcga 
caaaaactcg taaaagctct gatgtatcta 
tggacagttt tccctttgat atctaacggt 
ttgatgcttc actgatagat acaagagcca 
agtatgttct ctagtgtggt tcgttgtttt 
tcatgcttac tttgcatgtc actcaaaaat 
gcagttaaag catcgtgtag tgtttttctt 
atggttgttg gtattttgtc accattcatt 
agatccattt gtctatctag ttcaacttgg 
ttatcaacca ccaatttcat attgctgtaa 
acccattggt taagcctttt aaactcatgg 
aattcatcaa ggctaatctc tatatttgcc 
aataaccact <:ataaatcct catagagtat 
ttatatttta tgaatttttt taactggaaa 
aattctaatt tttcgcttga gaacttggca 
ttaaccaaag gattcctgat ttccacagtt 
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atattattga agcatttatc agggttattg 8160 
ttagaaaaat aaacaaatag gggttccgcg 8220 
ctaagaaacc attattatca tgacattaac 8280 
tegtctcgca -tgcggatcag fcgagggtttg 8340 
cacggcacga tcatogtgcg ggagggcaag 8400 
gagagcttgg cacccagcct gcgcgagcag 8460 
tgacctttaa tagattatat -tactaattaa 8520 
ttaaaaattt <tttcacaaaa <:ggtttacaa 8580 
ctgttctggt gttgctagtt tgttatcaga 8640 
gaaagcgcta ttt<:t£ceag aattgccatg 8700 
cccgtgttgt cggcagcttt gattcgataa 8760 
gtgactgttg agctgtaaca agttgtctca 8820 
ttttactggt ttcacctgtt ctattaggtg 8880. , 
tctgttcatg gtgaacagct ttaaatgcac 8 940 , 
tcttttttac accgtttbca tctgtgcata 9000. 
gaacagttgt tctacttttg tttgttagtc 9060 . 
taagaacctc agatccttcc gtatttagcc 9120 
tgcgtgagcc atgagaacga accattgaga 9180 
tttgcctcaa aactggtgag -ctgaattttt 9240 
agtccgttac gtaggtagga atctgatgta 9300 
tttatctggt tgttctcaag ttcggttacg 9360 
aaaatcaacg tatcagtcgg gcggcctcgc 9420 
gtgtttaaat ctttacttat tggtttcaaa 9480 
tagttatttt caagcattaa <:atgaactta 9540 
ttgtgagttt tcttttgtgt tagttctttt 9600 
ttgttttcaa aagacttaac atgttccaga 9660 
agataaggca atatctcttc actaaaaact 9720 
tagtttgtcc actggaaaat ctcaaagcct 9780 
ctogtcatca gctctctggt tgctttagct 9840 
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aatacaccat aagcattttc cctactgatg ttcatcatct gagcgtattg gttataagtg 9900 

aacgataccg tccgttcttt ccttgtaggg ttttcaatcg tggggttgag tagtgccaca 9960 

cagcataaaa ttagcttggt ttcatgctcc gttaagtcat agcgactaat cgctagttca 10020 

tttgctttga aaacaactaa ttcagacata catctcaatt ggtctaggtg attttaatca 10080 

ctataccaat tgagatgggc tagtcaatga taattactag tccttttcct ttgagttgtg 10140 

ggtatctgta aattctgcta gacctttgct ggaaaacttg taaattctgc tagaccctct 10200 

gtaaattccg ctagaccttt gtgtgttttt tttgtttata ttcaagtggt tataatttat 10260 

agaataaaga aagaataaaa aaagataaaa agaatagatc ccagccctgt gtataactca 10320 

ctactttagt cagttccgca gtattacaaa aggatgtcgc aaacgctgtt tgctcctcta 10380 

t 

caaaacagac cttaaaaccc taaaggctta agtagcaccc tcgcaagctc gggcaaatcg 10440 

ctgaatattc cttttgtctc cgaccatcag gcacctgagt cgctgtcttt ttcgtgacat 10500 

tcagttcgct gcgctcacgg ctctggcagt gaatgggggt aaatggcact acaggcgcct 10560 

tttatggatt catgcaagga aactacccat aatacaagaa aagcccgtca cgggcttctc 10620 

agggcgtttt atggcgggtc tgctatgtgg tgctatctga cttttcgctg ttcagcagtt 10680 

. cctgccctct gattttccag tctgaccact tcggattatc ccgtgacagg tcattcagac 10740 

•tggctaatgc acccagtaag gcagcggtat catcaacagg cttacccgtc ttactgtcaa 10800 

c 10801 

<210> 77 
<211> 8654 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Recombinant 
pAN236 plasmid 

<400> 77 

ctcgaggcct acctagcttc caagaaagat atcctaacag cacaagagcg gaaagatgtt 60 

ttgttctaca tccagaacaa cctctgctaa aattcctgaa aaattttgca aaaagttgtt 120 

gactttatct acaaggtgtg gtataataat cttaacaaca gcaggacgct ctagaggagg 180 

agacatcatg aaaattggaa ttatcggcgg aggctccgtt ggtcttttat gcgcctatta 240 

tttgtcactt tatcacgacg tgactgttgt gacgaggcgg caagaacagg ctgcggccat 300 

tcagtctgaa ggaatccggc tttataaagg cggggaggaa ttcagggctg attgcagtgc 360 
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ggacacgagt atcaattcgg actttgacct gcttgtcgtg acagtgaagc agcatcagct 420 
tcaatctgtt ttttcgtcgc ttgaacgaat cgggaagacg aatatattat ttttgcaaaa 480 
cggcatgggg catatccacg acctaaaaga ctggcacgtt ggccattcca tttatgttgg 540 
aatcgttgag cacggagctg taagaaaat-c ggatacagct gttgatcata caggcctagg 600 
tgcgataaaa tggagcgcgt tcgacgatgc tgaaccagac cggctgaaca tcttgtttca 660 
gcataaccat tcggattttc ogatttatta tgagacggat tggtaccgtc tgctgacggg 720 
caagctgatt gtaaatgcgt gtattaatcc tttaactgcg ttattgcaag tgaaaaatgg 780 
agaactgctg acaacgccag cttatctggc ttttatgaag ctggtatttc aggaggcatg 840 
ccgcatttta aaacttgaaa atgaagaaaa ggcttgggag cgggttcagg ccgtttgtgg 900 
gcaaacgaaa gagaatcgtt catcaatgct ggttgacgtc attggaggcc ggcagacgga 960 
agctgacgcc attatcggat acttattgaa ggaagcaagt cttcaaggtc ttgatgccgt 1020 
ccacctagag tttttatatg gcagcatcaa agcattggag cgaaatacaa acaaagtctt 1080 
ttgagctttt tcggtaacat gctatactca tttcggatca ctaactattt attggaggat 1140 
cctgttttgg cggatgagag aagattttca gcctgataca gattaaafcca gaacgcagaa 1200 
gcggtcrgat aaaacagaat ttgcctggcg gcagtagcgc ggtggtccca cctgacccca 1260 
tgccgaactc agaagtgaaa cgccgtagcg ccgatggtag tgtggggtct ccccatgcga 1320 
gagtagggaa ctgccaggca tcaaataaaa cgaaaggctc agtcgaaaga ctgggccttt 1380 
cgttttatct gttgtttgtc ggtgaacgct ctcctgagta ggacaaatoc gccgggagcg 1440 
gatttgaacg ttgcgaagca acggcccgga gggtggcggg caggacgccc gccataaact 1500 
gccaggcatc aaattaagca gaaggccatc ctgacggatg gcctttttgc gtttctacaa 1560 
actcttggta cccagaaaaa gcggcaaaag cggctgttaa aaaagcgaaa tcgaagaagc 1620 
tgtctgccgc taagacggaa tatcaaaagc gttctgctgt tgtgtcatct ttaaaagtca 1*680 
cagccgatga atcccagcaa gatgtcctaa aatacttgaa cacccagaaa gataaaggaa 1740 
atgcagacca aatt-cattct tattatgtgg tgaacgggat tgctgtfccat gcctcaaaag 1800 
aggttatgga aaaagtggtg cagtttcccg aagtggaaaa ggtgcttcct aatgagaaac 1860 
ggcagctttt taagtcatcc tccccattta atatgaaaaa agcacagaaa gctattaaag 1920 
caactgacgg tgtggaatgg aatgtagacc aaatcgatgc occaaaagct tgggcacttg 1980 
gatatgatgg aactggcacg gttgttgcgt ccattgatac cggggtggaa tggaatoatc 2040 
cggcattaaa agagaaatat cgcggatata atccggaaaa t<x:taatgag cctgaaaatg 2100 
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cctgcgagta 


caaactgggt 


gaacacagcc 


3840 



% 
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tttatgttaa ccttttccat t,ggaacagct 
atcaaaaggt tactcctatt tggaattata 
gttggccatt ctttcttttc cttacttatt 
gctgcatttc cagcactcgt aatggttgta 
ggtaaagcat ttggtcttat tggatcgata 
attggtggaa tgatagccca ttatattcat 
acaattatca ctgttccgtt tcttatgaaa 
cattttgata tcaaaggaat tatactaatg 
acaacatcat atagcatttc ttttcttatc 
aaacatatca ggaaagtaac agatcctttt 
tttatgattg gagttctttg tgggggaatt 
atggttcctt atatgatgaa agatgttcac 
attattttcc ctggaacaat gagtgtcatt 
gatagaagag gtcctttata cgtgttaaac 
ttaactgctt cctttctttt agaaacaaca 
. gttttaggtg ggctttcgtt caccaaaaca 
aaacagcagg aagctggtgc tggaatgagt 
ggaacaggta ttgcaattgt aggtggttta 
ttacctatgg aagttgatca gtcaacttat 
ggaatcattg tcattagttg gctggttacc 
ttctaaatcg ttaagggatc aactttggga 
cagttcgaag cggccgcaat tcttgaagac 
aggttaatgt catgataata atggtttctt 
tgcgcggaac ccctatttgt ttatttttct 
gacaataacc ctgataaatg cttcaataat 
atttccgtgt cgcccttatt cccttttttg 
cagaaacgct ggtgaaagta aaagatgctg 
tcgaactgga tctcaacagc ggtaagatcc 
caatgatgag cacttttaaa gttctgctat 
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gtatatggaa agctatctga tcaattaggc 3900 
ataaattgtt tcgggtcggt aattgggttt 39*60 
atggctcgtt ttattcaagg ggctggtgca 4020 
gttgcgcgct atattccaaa ggaaaatagg 4080 
gtagccatgg gagaaggagt cggtocagcg 4140 
tggtcctatc ttctactcat tcctat-gata 4200 
ttattaaaga aagaagtaag gataaaaggt 4260 
tctgtaggca ttgtattttt tatgttgttt 4 320 
gttagcgtgc tgtcattcct gatatttgta 4 380 
gttgatcccg gattagggaa aaatatacct 4 44 0 
atatttggaa cagtagcagg gtttgtctct 4 500 
cagctaagta ctgccgaaat cggaagtgta 4560 
attttcggct acattggtgg gatacttgtt 4620 • 
.atcggagtta catttctttc tgttagcttt 4t580 ■ 
t-catggttca tgacaattat aatcgtattt 4740.. 
gttatatcaa caatt^tttc aagtagcttg 4800 
.ttgcttaact ttaccagctt tttatcagag 4 8*60 . 
ttatccatac ccttacttga tcaaaggttg 4 920 
ctgtatagta atttgttatt acttttttca 4 980 
ttgaatgtat ataaacattc tcaaagggat 5040 
gagagttcaa aattgatcct ttttttataa 5100 
gaaagggcct -cgtgatacgc ctatttttat 5160 
agacgtcagg tggcactttt cggggaaatg 5220 
aaatacattc aaatatgtat ccgcfccatga 5280 
attgaaaaag gaagagtatg agtattcaac 5340 
cggcattttg ccttcctgtt tttgctcacc 5400 
aagatcagtt gggtgcacga gtgggttaca 5460 
ttgagagttt tcgccccgaa gaacgttttc 5520 
gtggcgcggt attatcccgt attgacgccg 5580 
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ggcaagagca actcggtcgc cgcatacact 
cagtcacaga aaagcatctt acggatggca 
taaccatgag tgataacact gcggccaact 
agctaaccgc ttttttgcac aacatggggg 
cggagctgaa tgaagccata ccaaacgacg 
caacaacgtt gcgcaaacta ttaactggcg 
taatagactg gatggaggcg gataaagttg 
ctggctggtt tattgctgat aaatctggag 
cagcactggg gccagatggt aagccctccc 
aggcaactat ggatgaacga aatagacaga 
attggtaact gtcagaccaa gtttactcat 
tttaatttaa aaggatctag gtgaagatcc 
aacgtgagtt ttcgttccac tgagcgtcag 
gagatccttt ttttctgcgc gtaatctgct 
cggtggtttg tttgccggat caagagctac 
gcagagcgca gataccaaat actgtccttc 
agaactctgt agcaccgcct acatacctcq 
ccagtggcga taagtcgtgt cttaccgggt 
cgcagcggtc gggctgaacg gggggttcgt 
acaccgaact gagataccta cagcgtgagc 
gaaaggcgga caggtatccg gtaagcggca 
ttccaggggg aaacgcctgg tatctttata 
agcgtcgatt tttgtgatgc tcgtcagggg 
cggccttttt acggttcctg gccttttgct 
tatcccctga ttctgtggat aaccgtatta 
gcagccgaac gaccgagcgc agcgagtcag 
ggtattttct ccttacgcat ctgtgcggta 
caatctgctc tgatgccgca tagttaagcc 
ggtcatggct gcgccccgac acccgccaac 
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attctcagaa 


tgacttggtt 


gagtactcac 


5640 


tgacagtaag 


agaattatgc 


agtgctgcca 


5700 


tacttctgac 


aacgatcgga 


ggaccgaagg 


5760 


atcatgtaac 


tcgccttgat 


cgttgggaac 


5820 


agcgtgacac 


cacgatgcct 


'gcagcaatgg 


5880 


aactacttac 


tctagcttcc 


cggcaacaat 


5940 


caggaccact 


tctgcgctcg 


gcccttccgg 


6000 


ccggtgagcg 


tgggtctcgc 


ggtatcattg 


6060 


gtatcgtagt 


tatctacacg 


acggggagtc 


6120 


tcgctgagat 


aggtgcctca 


ctgattaagc 


6180 


atatacttta 


gattgattta 


aaacttcatt 


6240 


tttttgataa 


tctcatgacc 


aaaatccctt 


6300 


accccgtaga 


aaagatcaaa 


ggatcttctt 


6360 


gcttgcaaac 


aaaaaaacca 


ccgctaccag 


6420 


caactctttt 


tccgaaggta 


actggcttca 


6480 


tagtgtagcc 


gtagttaggc 


caccacttca 


6540 


ctctgctaat 


cctgttacca 


gtggctgctg 


6600 


tggactcaag 


acgatagtta 


ccggataagg 


6660 


gcacacagcc 


cagcttggag 


cgaacgacct 


6720 


tatgagaaag 


cgccacgctt 


cccgaaggga 


6780 


gggtcggaac 


aggagagcgc 


acgagggagc 


6840 


gtcctgtcgg 


gtttcgccac 


ctctgacttg 


6900 


ggcggagcct 


atggaaaaac 


gccagcaacg 


6960 


ggccttttgc 


tcacatgttc 


tttcctgcgt 


7020 


ccgcctttga 


gtgagctgat 


accgctcgcc 


7080 


tgagcgagga 


agcggaagag 


cgcctgatgc 


7140 


tttcacaccg 


catatggtgc 


actctcagta 


7200 


agtatacact 


ccgctatcgc 


tacgtgactg 


7260 


acccgctgac 


gcgccctgac 


gggcttgtct 


7320 
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gctcccggca tccgcttaca gacaagctgt gaccgtctcc gggagctgca tgtgtcagag 7380 
gttttcaccg tcatcaccga aacgcgcgag gcagctgcgg taaagctcat cagcgtggtc 7440 
gtgaagcgat tcacagatgt ctgcctgttc atccgcgtcc agctcgttga gtttctccag 7500 
aagcgttaat gtctggcttc tgataaagcg ggccatgtta agggcggttt tttcctgttt 7560 
ggtcacttga tgcctccgtg taagggggaa *ttctgttca tgggggtaat gataccgatg 7«620 
aaacgagaga ggatgctcac gatacgggtt actgatgatg aacatgcccg gttactggaa 7-680 
cgttgtgagg gtaaacaact ggcggtatgg atgcggcggg accagagaaa aatcactcag 7740 
ggtcaatgcc agcgcttcgt taatacagat gtaggtgttc cacagggtag ccagcagcat 7B00 
cctgcgatgc agatccggaa cataatggtg cagggcgctg acttccgcgt ttccagactt 7860 
tacgaaacac ggaaaccgaa gaccattcat gttgttgctc aggtcgcaga cgttttgcag 7920 
cagcagtcgc ttcacgttcg ctcgcgtatc ggtgattcat tctgctaacc agtaaggcaa 7980 
ccccgccagc ctagccgggt cctcaacgac aggagcacga tcatgcgcac ccgtggccag 8040 
gacccaacgc tgcccgagat gcgccgcgtg cggctgctgg agatggcgga cgcgatggat -8100 
atgttctgcc aagggttggt ttgcgcattc acagt.tctcc gcaagaattg attggctcca 8160 
attcttggag tggtgaatcc gttagcgagg tgccgccggc ttccattcag gtcgaggtgg 8220 
cccggctcca tgcaccgcga cgcaacgcgg ggaggcagac aaggtatagg gcggcgccta 3280 
caatccatgc caacccgttc catgtgctcg ccgaggcggc ataaatcgcc gtgacgatca 8340 
gcggtccagt gatcgaagtt aggctggtaa gagccgcgag cgatccttga agctgtccct 8400 
gatggtcgtc atctacctgc ctggacagca tggcctgcaa cgcgggcatc ccgatgccgc 84*60 
cggaagcgag aagaatcata atggggaagg ccatccagcc tcgcgtcggc ggccgcttcg 6520 
tcgaccgaaa cagcagttat aaggcatgaa gctgtccggt ttttgcaaaa gtggctgtga S580 
ctgtaaaaag aaatcgaaaa agaccgtttt gtgtgaaaac ggtctttttg tttcctttta 8640 
accaactgcc ataa -8654 

<210> 78 
<211> 8093 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Recombinant 
pAN423 plasmid 



<400> 78 

ggcggccgct tcgtcgaccg aaacagcagt tataaggcat ^aagctgtcc ggtttttgca 60 
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aaagtggctg tgactgtaaa aagaaatcga 
ttgtttcctt ttaaccaact gccataactc 
ctaacagcac aagagcggaa agatgttttg 
tcctgaaaaa ttttgcaaaa agttgttgac 
aacaacagca ggacgctcta gaaaaggagg 
caaacttcac agggcaactg ttacggaagc 
tgatgaagat ctcattgatg ctgtgggaat 
taataataat ggagcacgtc ttgaaacgta 
catatgctta aacggtgcag ccgcacgcct 
ttcctacaaa atgatgtctg atcaagaagc 
gaatgatcaa aacaaaattg aacaaatgct 
aaggatcctg ttttggcgga tgagagaaga 
gcagaagcgg tctgataaaa cagaatttgc 
accccatqcc gaactcagaa gtgaaacgcc 
atgcgagagt agggaactgc caggcatcaa 
gcctttcgtt ttatctgttg tttgtcggtg 
ggagcggatt tgaacgttgc gaagcaacgg 
taaactgcca ggcatcaaat taagcagaag 
ctacaaactc ttggtaccca gaaaaagcgg 
agaagctgtc tgccgctaag acggaatatc 
aagtcacagc cgatgaatcc cagcaagatg 
aaggaaatgc agaccaaatt cattcttatt 
caaaagaggt tatggaaaaa gtggtgcagt 
agaaacggca gctttttaag tcatcctccc 
ttaaagcaac tgacggtgtg gaatggaatg 
cacttggata tgatggaact ggcacggttg 
atcatccggc attaaaagag aaatatcgcg 
aaaatgaaat gaactggtat gatgccgtag 
ctcatggaac ccacgtgaca ggcacgatgg 



aaaagaccgt 


tfctcrtcrtcraa 

u i. l^j vy ua 


da^y y tU-LLU 




aaocrcct acc 

*J ^ \A U SJi N**« 


"t~arrr , "h*rf , r" , aa 


y acidy a, T- a u c 


i on 


ttcfcaeatcc 


a rraa cz* a r*c^ 
ayaQL.flaLO i- 


/"» 4~ rro fraa a al- 
LLgCLdddal. 


OA fi 


ttt-at-rt-ana 


^gg^yt-ggta 


faaf aaf o 4*4- 

uaataauctit: 






LOLuy Lauya 


uga tgay egg 






uaty ugggaa 


gcattacaat 


420 


gctLCCuadl 


gaaaaagtac 


aaattgtgaa 


480 


LdLLdLULLl 


ggtaaacggg 


gaagcggcgt 


540 


ugty caggaa 


ggagataagg 


tcattattat 


600 


gycaagccau 


gagccgaaag 


tggctgttct 


660 


yyyy aacgaa 


ccagcccgta 


caattttgta 


720 


tutu CdyCCL 


gatacagatt 


aaatcagaac 


780 


*- u yy cgycag 


uagcgcggug 


gtcccacctg 


840 


gtagcgccga 


tggtagtgtg 


gggtctcccc 


900 


ataaaacgaa 


aggctcagtc 


gaaagactgg 


960 


aaCyCUCLCC 


tgagtaggac 


aaatccgccg 


1020 


uccygagggt 


ggcgggcagg 


acgcccgcca 


1080 


ytt-dtLL uuja 


cgganggcct: 


ttttgegttt- 


1140 


/"•ana n rrin 


4" /t4- 4~ a sa = ?a -5 
LyLLdddddd 


gegaaatega 


1200 


aaaay U-y l_ v. w 


t.gci_gt.ty ug 


tcatctttaa 


1260 


fccct aaaaf a 

w *>*»w CI Vh* u U V* CI 




cagaaagata 


1320 


atotcrataaa 




gttcatgect 


1380 


t tccca aa.tr t - 


yyaaaayy i_y 


cttcctaatg 1440 


catttaatat 


gaaaaaagca 


cagaaagcta 


1500 


tagaccaaat 


cgatgcccca 


aaagcttggg 


1560 


ttgcgtccat 


tgataccggg 


gtggaatgga 


1620 


gatataatcc 


ggaaaatcct 


aatgagcctg 


1680 


caggcgaggc 


aagcccttat 


gatgatttgg 1740 


tgggctctga 


acctgatgga 


acaaatcaaa 


1800 
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tcggtgtagc acctggcgca aaatggattg <:tgttaaagc gttctctgaa gatggcggca I860 
ctgatgctga cattttggaa gctggtgaat gggttttagc accaaaggac gcggaaggaa 1*20 
atccccaccc ggaaatggct cctgatgttg tcaataactc atggggaggg ggctctggac 1980 
ttgatgaatg gtacagagac atggtcaatg cctggcgttc ggccgatatt ttccctgagt 2040 
tttcagcggg gaatacggat ctctttattc ccggcgggcc tggttctatc gcaaatccgg 2100 
caaactatcc agaatcgttt gcaactggag cgactgagaa ttccaattcc ccatggagag 2160 
aaaagaaaat cgctaatgtt gattactttg aactfcctgca tattcttgaa tttaaaaagg 2220 
ctgaaagagt aaaagattgt gctgaaatat tagagtataa acaaaatcgt gaaacaggcg 2280 
aaagaaagtt gtatcgagtg tggttttgta aatxscaggct ttgtccaktg tgcaactgga 2340 
ggagagcaat gaaacatggc attcagtcac aaaaggttgt tgctgaagtt attaaacaaa 2400 
agccaacagt tcgttggttg tttctcacat taacagt^taa aaatgtttat gatggcgaag 24 60 
aattaaataa gagtttgtca gatatggctc aaggatttcg ccgaatgatg caatataaaa 2520 
aaattaataa aaatcttgtt ggttttatgc gtgcaacgga agtgacaata aataataaag 2580 
ataattctta taatcagcac atgcatgtat tggtatgtgt ggaaccaact tattttaaga 2640 
atacagaaaa ctacgtgaat caaaaacaat ggatteaatt ttggaaaaag gcaatgaaat 2700 
tagactatga tccaaatgta aaagttcaaa tgattcgacc gaaaaataaa tataaatcgg 2760 
atatacaatc ggcaattgac gaaactgcaa aatatcctgt aaaggatacg gattttatga 2820 
ccgatgatga agaaaagaat ttgaaacgtt tgtctgattt ggaggaaggt ttacaccgta 2880 
aaaggttaat ctcctatggt ggtttgttaa aagaaataca taaaaaatta aaccttgatg 2940 
acacagaaga aggcgatttg attcatacag atgatgacga aaaagccgat gaagatggat 3000 
tttctattat tgcaatgtgg aattgggaac ggaaaaakta ttttattaaa gagtagttca 3060 
acaaacgggc catattgttg tataagtgat gaaatactga atttaaaact tagtttatat 3120 
gtggtaaaat gttttaatca agtttaggag gaattaatta tgaagtgtaa tgaatgtaac 3180 
agggttcaat taaaagaggg aagcgtatca ttaaccctat aaactacgtc tgccctcatt 3240 
attggagggt gaaatgtgaa tacatcctat t<:acaatcga atttacgaca caaccaaatt 3300 
ttaatttggc tttgcatttt atcttttttt agcgtattaa atgaaatggt tttgaacgtc 3360 
tcattacctg atattgcaaa tgattttaat aaaccacctg cgagtacaaa ctgggtgaac 3420 
acagccttta tgttaacctt ttccattgga acagctgtat atggaaagct atctgatcaa 3480 
ttaggcatca aaaggttact cctatttgga attataataa attgtttcgg gtcggtaatt 3540 
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gggtttgttg gccattcttt cttttcctta 
ggtgcagctg catttccagc actcgtaatg 
i aataggggta aagcatttgg tcttattgga 
ccagcgattg gtggaatgat agcccattat 
atgataacaa ttatcactgt tccgtttctt 
aaaggtcatt ttgatatcaa aggaattata 
ttgtttacaa catcatatag catttctttt 
tttgtaaaac atatcaggaa agtaacagat 
atacctttta tgattggagt tctttgtggg 
gtctctatgg ttccttatat gatgaaagat 
agtgtaatta ttttccctgg aacaatgagt 
cttgttgata gaagaggtcc tttatacgtg 
agctttttaa ctgcttcctt tcttttagaa 
gtatttgttt taggtgggct ttcgttca'cc 
agcttgaaac agcaggaagc tggtgctgga 
tcagagggaa caggtattgc aattgtaggt 
aggttgttac ctatggaagt tgatcagtca 
ttttcaggaa tcattgtcat tagttggctg 
agggatttct aaatcgttaa gggatcaact 
ttataacagt tcgaagcggc cgcaattctt 
ttttataggt taatgtcatg ataataatgg 
gaaatgtgcg cggaacccct atttgtttat 
tcatgagaca ataaccctga taaatgcttc 
ttcaacattt ccgtgtcgcc cttattccct 
ctcacccaga aacgctggtg aaagtaaaag 
gttacatcga actggatctc aacagcggta 
gttttccaat gatgagcact tttaaagttc 
acgccgggca agagcaactc ggtcgccgca 
actcaccagt cacagaaaag catcttacgg 



cttattatgg ctcgttttat tcaaggggct 3600 
gttgtagttg cgcgctatat tccaaaggaa 3660 
tcgatagtag ccatgggaga aggagtcggt 3720 

attcattggt cctatcttct actcattcct 3780 

i 

atgaaattat taaagaaaga agtaaggata 3840 
ctaatgtctg taggcattgt attttttatg 3900 
cttatcgtta gcgtgctgtc attcctgata 3960 
ccttttgttg atcccggatt agggaaaaat 4020 
ggaattatat ttggaacagt agcagggttt 4080 
gttcaccagc taagtactgc cgaaatcgga 4140 
gtcattattt tcggctacat tggtgggata 4200 
ttaaacatcg gagttacatt tctttctgtt 4260 
acaacatcat ggttcatgac aattataatc 4320 
aaaacagtta tatcaacaat tgtttcaagt 4380 ,'. 
atgagtttgc ttaactttac cagcttttta 4 4 40 
ggtttattat ccataccctt acttgatcaa 4500 
acttatctgt atagtaattt gttattacx:t 4560 . . 
gttaccttga atgtatataa acattctcaa 4620 
ttgggagaga gttcaaaatt gatccttttt 4€80 
gaagacgaaa gggcctcgtg atacgcctat 4 74 0 
tttcttagac gtcaggtggc acttttcggg 4800 
ttttctaaat acattcaaat atgtatccgc 4860 
aataatattg aaaaaggaag agtatgagta 4920 
tttttgcggc attttgcctt cctgtttttg 4 980 
atgctgaaga tcagttgggt gcacgagtgg 5040 
agatccttga gagttttcgc cccgaagaac 5100 
tgctatgtgg cgcggtatta tcccgtattg 5160 
tacactattc tcagaatgac ttggttgagt 5220 
atggcatgac agtaagagaa ttatgcagtg 5280 
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ctgccataac catgagtgat aacactgcgg ccaacttact -tctgacaacg atcggaggac 5340 
cgaaggagct aaccgctttt ttgcacaaca *gggggatca tgtaactcgc cttgatcgtt 5400 
gggaaccgga gctgaatgaa gccataccaa acgacgagcg tgacaccacg atgcctgcag 54€0 
caatggcaac aacgttgcgc aaactattaa ctggcgaact acttactcta gcttcccggc 5520 
aacaattaat agactggatg gaggcggata aagttgcagg accacttctg* cgctcggccc 5580 
ttccggctgg ctggtttatt gctgataaat ctggagccgg tgagcgtggg tctcgcggta 5640 
tcattgcagc actggggcca gatggtaagc cctcccgtat -cgtagttatc tacacgacgg 5700 
ggagtcaggc aactatggat gaacgaaata ^acagatcgc tgagataggt gcctcactga 5760 
ttaagcattg gtaactgtca gaccaagttt actcatatat acttta^att gatttaaaac 5620 
ttcattttta atttaaaagg atctaggtga agatcctttt tgataafcctc atgaccaaaa 5880 
tcccttaacg tgagttttcg ttccactgag .cgtcagaccc <:gtagaaaag atcaaaggat 5940 
cttcttgaga tccttttttt ctgcgcgtaa tctgctgctt gcaaacaaaa aaaccaccgc 6000 
taccagcggt ggtttgtttg ccggatcaag agctaccaac tctttttccg aaggtaactg 60/60 , 
gcttcagcag agcgcagata ccaaatactg tccttctagt gtagccgtag -ttaggccacc 612C 
acttcaagaa ctctgtagca ccgcctacat acctcgctct -gctaatcctg ttaccagtgg 6180 
ctgctgccag tggcgataag tcgtgtctta ccgggttgga ctcaagacga tagttaccgg 6240 
ataaggcgca gcggtcgggc tgaacggggg gttcgtgcac acagcccagc ttggagcgaa 6300 
cgacctacac cgaactgaga tacctacagc gtgagctatg agaaagcgcc acgcttcccg 6360 
aagggagaaa ggcggacagg tatccggtaa gcggcagggt <:ggaacagga gagcgcacga 6420 
gggagcttcc agggggaaac gcctggtatc tttatagtcc tgtcgggttt -cgccacctct €4 80 
gacttgagcg tcgatttttg tgatgctcgt -caggggggcg -gagcctatgg aaaaacgcca 6540 
gcaacgcggc ctttttacgg ttcctggcct tttgctggcc ttttgctcac atgttctttc 6600 
ctgcgttatc ccctgattct gtggataacc gtattaccgc ctttgagtga gctgataccg 6660 
ctcgccgcag ccgaacgacc gagcgcagcg agtcagtgag cgaggaagcg ^gaagagcgcc -6720 
tgatgcggta ttttctcctt acgcatctgt gcggtatttc acaccgcata tggtgcactc 6780 
tcagtacaat ctgctctgat gccgcatagt taagccagta *acactccgc tat<:gctacg "6840 
tgactgggtc atggctgcgc cccgacaccc gccaacaccc gctgacgcgc cctgacgggc 6900 
ttgtctgctc ccggcatccg cttacagaca agctgtgacc gtctccggga gctgcatgtg 6960 
tcagaggttt teaccgtcat caccgaaacg cgcgaggcag ctgcggtaaa gctcatcagc 7020 
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gtggtcgtga agcgattcac agatgtctgc ctgttcatcc gcgtccagct cgttgagttt 7080 
ctccagaagc gttaatgtct ggcttctgat aaagcgggcc atgttaaggg cggttttttc 7140 
' ctgtttggtc acttgatgcc tccgtgtaag ggggaatttc tgttcatggg ggtaatgata 7200 
ccgatgaaac gagagaggat gctcacgata cgggttactg atgatgaaca tgcccggtta 7260 
ctggaacgtt gtgagggtaa acaactggcg gtatggatgc ggcgggacca gagaaaaatc 7320 
actcagggtc aatgccagcg cttcgttaat acagatgtag gtgttccaca gggtagccag 7380 
cagcatcctg cgatgcagat ccggaacata atggtgcagg gcgctgactt ccgcgtttcc 74 40 
agactttacg aaacacggaa accgaagacc attcatgttg ttgctcaggt cgcagacgtt 7500 
ttgcagcagc agtcgcttca cgttcgctcg cgtatcggtg attcattctg ctaaccagta 7560 
aggcaacccc gccagcctag ccgggtcctc aacgacagga gcacgatcat gcgcacccgt 7620 
ggccaggacc caacgctgcc cgagatgcgc cgcgtgcggc tgctggagat ggcggacgcg 7680 
atggatatgt tctgccaagg gttggtttgc gcattcacag ttctccgcaa gaattgattg 7740 
gctccaattc ttggagtggt gaatccgtta gcgaggtgcc gccggcttcc attcaggtcg 7800 ; 
aggtggcccg gctccatgca ccgcgacgca acgcggggag gcagacaagg tatagggcgg 7860 \ 
cgcctacaat ccatgccaac ccgttccatg tgctcgccga ggcggcataa atcgccgtga 7 920 
cgatcagcgg tccagtgatc gaagttaggc tggtaagagc cgcgagcgat ccttgaagct 7980 
gtccctgatg gtcgtcatct acctgcctgg acagcatggc ctgcaacgcg ggcatcccga 8040 
tgccgccgga agcgagaaga atcataatgg ggaaggccat ccagcctcgc gtc ,8093 

<210> 79 
<211> 8098 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Recombinant 
pAN429 plasmid 

<400> 79 

ggcggccgct tcgtcgaccg aaacagcagt tataaggcat gaagctgtcc ggtttttgca 60 

aaagtggctg tgactgtaaa aagaaatcga aaaagaccgt tttgtgtgaa aacggtcttt 120 

ttgtttcctt ttaaccaact gccataactc gaggcctacc tagcttccaa gaaagatatc 180 

ctaacagcac aagagcggaa agatgttttg ttctacatcc agaacaacct ctgctaaaat 240 

tcctgaaaaa ttttgcaaaa agttgttgac tttatctaca aggtgtggta taataatctt 300 
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aacaacagca ggacgctcta gattagaaag gaggtttaat taatgtatcg tacgatgatg 360 
agcggcaaac ttcacagggc aactgttacg gaagcaaacc tgaactatgt -gggaagcatt 420 
1 acaattgatg aagatctcat tgatgctgtg ggaatgcttc -ctaatgaaaa agtacaaatt 430 
gtgaataata ataatggagc acgtcttgaa acgtatatta ttcctggtaa acggggaagc "540' 
ggcgtcatat gcttaaacgg tgcagccgca cgccttgtgc aggaaggaga taaggtcatt "600 ' 
attatttcct acaaaatgat cftctgatcaa gaagcggcaa gccatgagcc gaaagtggct 660 
gttctgaatg atcaaaacaa aattgaacaa atgctgggga acgaaccagc ccgtacaatt 720 
ttgtaaagga tcctgttttg gcggatgaga gaagattttc agcctgatac agattaaatc 780 
agaacgcaga agcggtctga taaaacagaa tttgcctggc ggcagtagcg cggtggtccc 840 
acctgacccc atgccgaact cagaagtgaa acgccgtagc gccgatggta gtgtggggtc 900 
tccccatgcg agagtaggga actgccaggc atcaaataaa acgaaaggct cagtcgaaag 960 
actgggcctt tcgttttatc tgttgtttgt cggtgaacgc tctoctgagt aggacaaatc 1020 
cgccgggagc ggatttgaac gttgcgaagc aacggcccgg agggtggcgg gcaggacgcc 1O80 
cgccataaac tgccaggcat caaattaagc agaaggccat -cctgacggat ggcctttttg 1140 ; 
cgtttctaca aactcttggt acccagaaaa agcggcaaaa gcggctgtta aaaaagcgaa 1200 . 
atcgaagaag ctgtctgccg ctaagacgga atatcaaaag cgttctgctg ttgtgtcatc 12,60 
tttaaaagtc acagccgatg aat-cccagca agatgtccta aaatacttga acacccagaa 1320 
agataaagga aatgcagacc aaattcattc ttattatgtg gtgaacggga ttgctgttca 1380 
tgcctcaaaa gaggttatgg aaaaagtggt gcagtttocc gaagtggaaa aggtgcttcc 1440 
taatgagaaa cggcagcttt ttaagtcatc ctccccattt aatatgaaaa aagcacagaa 1'500 
agctattaaa gcaactgacg gtgtggaatg gaatgtagac -caaat-cgatg ccccaaaagc 1560 
' ttgggcactt ggatatgatg gaactggcac ggttgttgcg tccattgata -ccggggtgga 1620 
atggaatcat ccggcattaa aagagaaata tcgeggatat aatccggaaa atcctaatga 1680 
gcctgaaaat gaaatgaact ggtatgatgc cgtagcaggc gaggcaagcc cttatgatga 1740 
tttggctcat ggaacccacg tgacaggcac -gatggtgggc tctgaacctg atggaacaaa 1€00 
tcaaatcggt gtagcacctg gcgcaaaatg gattgctgtt aaagcgttct ctgaagatgg 1860 
cggcactgat gctgacattt tggaagctgg tgaatgggtt ttagcaccaa aggacgcgga 1920 
aggaaatccc cacccggaaa tggctcctga tgttgtcaat aacfccatggg gagggggctc 1980 
tggacttgat gaatggtaca gagacatggt caatgcctgg cgttcggccg atattttccc -2040 
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tgagttttca gcggggaata cggatctctt 
tccggcaaac tatccagaat c'gtttgcaac 
gagagaaaag aaaatcgcta atgttgatta 
aaaggctgaa agagtaaaag attgtgctga 
aggcgaaaga aagttgtatc gagtgtggtt 
ctggaggaga gcaatgaaac atggcattca 
acaaaagcca acagttcgtt ggttgtttct 
cgaagaatta aataagagtt tgtcagatat 
taaaaaaatt aataaaaatc ttgttggttt 
taaagataat tcttataatc agcacatgca 
taagaataca gaaaactacg tgaatcaaaa 
gaaattagac tatgatccaa atgtaaaagt 
atcggatata caatcggcaa ttgacgaaac 
tatgaccgat gatgaagaaa agaatttgaa 
ccgtaaaagg ttaatctcct atggtggttt 
tgatgacaca gaagaaggcg atttgattca 
tggattttct attattgcaa tgtggaattg 
gttcaacaaa cgggccatat tgttgtataa 
tatatgtggt aaaatgtttt aatcaagttt 
gtaacagggt tcaattaaaa gagggaagcg 
tcattattgg agggtgaaat gtgaatacat 
aaattttaat ttggctttgc attttatctt 
acgtctcatt acctgatatt gcaaatgatt 
tgaacacagc ctttatgtta accttttcca 
atcaattagg catcaaaagg ttactcctat 
taattgggtt tgttggccat tctttctttt 
gggctggtgc agctgcattt ccagcactcg 
aggaaaatag gggtaaagca tttggtctta 
tcggtccagc gattggtgga atgatagccc 
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tattcccggc 


gggcctggtt ctatcgcaaa 


2100 


tggagcgact 


gagaattcca attccccatg 


2160 


ctttgaactt 


ctgcatattc ttgaatttaa 


2220 


aatattagag 


tataaacaaa atcgtgaaac 


2280 


ttgtaaatcc 


aggctttgtc caatgtgcaa 


2340 


gtcacaaaag 


gttgttgctg aagttattaa 


2400 


cacattaaca 


gttaaaaatg tttatgatgg 


2460 


ggctcaagga 


tttcgccgaa tgatgcaata 


2520 


tatgcgtgca 


acggaagtga caataaataa 


2580 


tgtattggta 


tgtgtggaac caacttattt 


2640 


acaatggatt 


caattttgga aaaaggcaat 


2700 


tcaaatgatt 


cgaccgaaaa ataaatataa 


2760 


tgcaaaatat 


cctgtaaagg atacggattt 


2820 


acgtttgtct 


gatttggagg aaggthtaca 


2880 


gttaaaagaa 


atacataaaa aattaaacct 


2940 


tacagatgat 


gacgaaaaag ccgatgaaga 


3000 


ggaacggaaa 


aattatttta ttaaagagta 


3060 


gtgatgaaat 


actgaattta aaacttagtt 


3120 


aggaggaatt 


aattatgaag tgtaatgaat 


3180 


tatcattaac 


cctataaact acgtctgccc 


3240 


cctattcaca 


atcgaattta cgacacaacc 


3300 


tttttagcgt 


attaaatgaa atggttttga 


3360 


ttaataaacc 


acctgcgagt acaaactggg 


3420 


ttggaacagc 


tgtatatgga aagctatctg 


3480 


ttggaattat 


aataaattgt ttcgggtcgg 


3540 


ccttacttat 


tatggctcgt tttattcaag 


3600 


taatggttgt 


agttgcgcgc tatattccaa 


3660 


ttggatcgat 


agtagccatg ggagaaggag 


3720 


attatattca 


ttggtcctat cttctactca 


3780 
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ttcctatgat aacaattatc actgttccgt ttcttat^aa attattaaag aaagaagtaa 3840 
ggataaaagg tcattttgat atcaaaggaa ttatactaa* gtctgtaggc attgtatttt 3900 
' ttatgttgtt tacaacatca tatagcattt cttttcttat -cgttagcgtg -cbgtcattcc 3960 
tgatatttgt aaaacatatc aggaaagtaa cagatccttt tgttgatccc ggattaggga 4020 
aaaatatacc ttttatgatt ggagttcttt gtgggggaat tatatttgga acagtagcag 4080 
ggtttgtctc tatggttcct tatatgatga aagatgttca ccagctaagt actgccgaaa 4140 
tcggaagtgt aattattttc cctggaacaa tgagtgtcat tattttcggc tacattggtg 4200 
ggatacttgt tgatagaaga ggtcctttat acgtgttaaa catcggagtt acatttcttt 42^0 
ctgttagctt tttaactgct tcctttcttt tagaaacaac atcatggttc atgacaat-ta 4320 
taatcgtatt tgttttaggt gggctttcgt tcaccaaaac agttatatca acaattgttt 4 380 
caagtagctt gaaacagcag gaagctggtg ctggaatgag tttgcttaac fcttaccagct 4 4 40 

» 

ttttatcaga gggaacaggt att^gcaattg taggtggttt attatccata <cccttactt<g 4 500 
atcaaaggtt gttacctatg gaagttgatc agtcaactta totgtatagt aatttgttat 4560 
tacttttttc aggaatcatt gtcattagtt -ggctggttac cttgaatgta tataaacatt 4620 
ctcaaaggga tttctaaatc gttaagggat caactttggg agagagtcca aaattgatcc 4680 
tttttttata acagttcgaa gcggccgcaa ttcttgaaga cgaaagggcc -tcgtgatacg 4 7 40 
cctattttta taggttaatg tcatgataat aatggtttct tagacgtcag -gtggcacttt 4B0O 
tcggggaaat gtgcgcggaa cccctatttg tttattttto taaatacatt -caaatatgta 48 60 
tccgctcatg agacaataac cctgataaat gcttcaataa tattgaaaaa <ggaagagtat 4920 
gagtattcaa catttccgtg tcgcccttat tccctttttt gcggcatttt .gccttcctgt 4980 
ttttgctcac ccagaaacgc tggtgaaagt aaaagatgct gaagatcagt tgggtgcacg 5040 
agtgggttac atcgaactgg atctcaacag cggtaagatc cttgagagtt ttcgccccga 5100 
agaacgtttt ccaatgatga gcacttttaa agttctgota tgtggcgcgg tattat-cccg 5160 
tattgacgcc gggcaagagc aactcggtcg ccgcataca-c *atbctcaga atgacttggt 5220 
tgagtactca ccagtcacag aaaagcatct tacggatggc atgacagtaa gagaattatg 5280 
cagtgctgcc ataaccatga gtgataacac tgcggccaac ttacttctga caacgatcgg 5340 
aggaccgaag gagctaaccg cttttttgca caacatgggg gatcatgtaa ctcgccttga 5400 
tcgttgggaa ccggagctga atgaagccat accaaacgac gagcgtgaca ccacgatgcc 54 60 
tgcagcaatg gcaacaacgt tgcgcaaact attaactggc gaactactta cbctagcttc 5520 



rnsdocio <wo 
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ccggcaacaa ttaatagact ggatggaggc 
ggcccttccg gctggctggt ttattgctga 
cggtatcatt gcagcadtgg ggccagatgg 
gacggggagt caggcaacta tggatgaacg 
actgattaag cattggtaac tgtcagacca 
aaaacttcat ttttaattta aaaggatcta 
caaaatccct taacgtgagt tttcgttcea 
aggatcttct tgagatcctt tttttctgcg 
accgctacca gcggtggttt gtttgccgga 
aactggcttc agcagagcgc agataccaaa 
ccaccacttc aagaactctg tagcaccgcc 
agtggctgct gccagtggcg ataagtcgtg 
accggataag gcgcagcggt cgggctgaac 
gcgaacgacc fcacaccgaac tgagatacct 
tcccgaaggg agaaaggcgg acaggtatcc 
cacgagggag cttccagggg gaaacgcctg 
cctctgactt gagcgtcgat ttttgtgatg 
cgccagcaac gcggcctttt tacggttcct 
ctttcctgcg ttatcccctg attctgtgga 
taccgctcgc cgcagccgaa cgaccgagcg 
gcgcctgatg cggtattttc tccttacgca 
cactctcagt acaatctgct ctgatgccgc 
ctacgtgact gggtcatggc tgcgccccga 
cgggcttgtc tgctcccggc atccgcttac 
atgtgtcaga ggttttcacc gtcatcaccg 
tcagcgtggt cgtgaagcga ttcacagatg 
agtttctcca gaagcgttaa tgtctggctt 
ttttcctgtt tggtcacttg atgcctccgt 
tgataccgat gaaacgagag aggatgctca 
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ggataaagtt gcaggaccac ttctgcgctc 5580 
taaatctgga gccggtgagc gtgggtctcg 5640 
taagccctcc cgtatcgtag ttatctacac 5700 
aaatagacag atcgctgaga taggtgcctc 5760 
agtttactca tatatacttt agattgattt 5820 
ggtgaagatc ctttttgata atctcatgac 5880 
ctgagcgtca gaccccgtag aaaagatcaa 5940 
cgtaatctgc tgcttgcaaa caaaaaaacc 6000 
tcaagagcta ccaactcttt ttccgaaggt 6060 
tactgtcctt ctagtgtagc cgtagttagg 6120 
tacatacctc gctctgctaa tcctgttacc 6180 
tcttaccggg ttggactcaa gacgatagtt 6240 
ggggggttcg tgcacacagc ccagcttgga 6300 
acagcgtgag ctatgagaaa gcgccacgct 6360 
ggtaagcggc agggtcggaa caggagagcg 6420 
gtatctttat agtcctgtcg ggtttcgcca 6480 
ctcgtcaggg gggcggagcc tatggaaaaa 6540 
ggccttttgc tggccttttg ctcacatgtt 6600 
taaccgtatt accgcctttg agtgagctga 6660 
cagcgagtca gtgagcgagg aagcggaaga 6720 
tctgtgcggt atttcacacc gcatatggtg 6780 
atagttaagc cagtatacac tccgctatcg 6840 
cacccgccaa cacccgctga cgcgccctga 6900 
agacaagctg tgaccgtctc cgggagctgc 6960 
aaacgcgcga ggcagctgcg gtaaagctca 7020 
tctgcctgtt catccgcgtc cagctcgttg 7080 
ctgataaagc gggccatgtt aagggcggtt 7140 
gtaaggggga atttctgttc atgggggtaa 7200 
cgatacgggt tactgatgat gaacatgccc 7260 



WO 01/021772 PCT/USOO/25993 

- no- 

ggttactgga acgttgtgag ggtaaacaac tggcggtatg gatigcggcgg gaccagagaa 7320 
aaatcactca gggtcaatgc cagcgcttcg ttaatacaga tgtaggtgtt ccacagggta 7380 
gccagcagca tcctgcgatg cagatccgga acataatggt gcagggcgct gactfcccgcg 7440 
tttccagact ttacgaaaca cggaaaccga agaccattca -tgttgttgct caggtcgcag 7500' 
acgttttgca gcagcagtcg cttcacgttc gctcgcgtat cggtgattca ttctgctaac 7560 
cagtaaggca accccgccag cctagccggg tcctcaacga caggagcacg atcatgcgca 7620 
cccgtggcca ggacccaacg ctgcccgaga tgcgccgcgt gcggctgctg gagatggcgg 7680 
acgcgatgga tatgttctgc caagggttgg tttgcgcatt -cacagttctc cgcaagaatt 77 40 
gattggctcc aattcttgga gtggtgaatc cgttagcgag gtgccgccgg -cttccattca 7800 

ggtcgaggtg gcccggctcc atgcaccgcg acgcaacgcg gggaggcaga -caaggtatag 7860 

i 

, ggcggcgcct acaatccatg ccaacccgtt ccatgtgctc <gccgaggcgg cataaatcgc 7920 
cgtgacgatc agcggtccag tgatcgaagt taggctggta agagccgcga gcgatccttg 7980 

. aagctgtccc tgatggtcgt catctacctg cct-ggacagc atggcctgca acgcgggcat 8040 , 
cccgatgccg ccggaagcga gaagaatcat aatggggaag gccatccagc ctcgcgtc 8098 

<210> -80 
<211> 4450 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Recombinant 
pAN4 4 3 plasmid 

<400> 80 

ccatcgaatg gccagatgat taattcctaa tttttgttga cactctatca ttgatag;agt 60 
tattttacca ctccctatca gtgatagaga aaagtgaaat gaatagttcg acaaaaatct 120 
agattagaaa ggaggattta aatatgagac agattact^ga *atttcacag ctgaaagaag 180 
ccataaaaca ataccattca gagggcaagt caatcggatt tgttccgacg atggggtttc 240 
tgcatgaggg gcatttaacc ttagcagaca aagcaag;aca agaaaacgac gccgttatta 300 
tgagtatttt tgtgaatcct gcacaattcg gccctaatga agattttgaa gcatatccgc 360 
gcgatattga gcgggatgca gctcttgcag aaaacgccgg agtcgatatt ctttttacgc 420 
cagatgctca tgatatgtat cccggtgaaa agaatgtcac gattcatgta gaaagacgca 480 
cagacgtgtt atgcgggcgc tcaagagaag gacattttga cggggtcgcg atcgtactga 540 
cgaagctttt caatctagtc aagccgactc -gt^cctattt cggtttaaaa -gatgcgcagc 600 
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aggtagctgt tgttgatggg ttaatcagcg 
tcgatacggt cagagaggaa gacggcttag 
ctgaggaaag aaaagaagcg cctaagctgt 
tccaagccgg tgaaagagat cctgaagcgg 
cgactagcgg aaccatagac tatgtagagc 
atgaaattgc tggaaagatg attctcgctg 
tagataatat cattattgat attcgtagaa 
atgagcggca aacttcacag ggcaactgtt 
attacaattg atgaagatct cattgatgct 
attgtgaata ataataatgg agcacgtctt 
agcggcgtca tatgcttaaa cggtgcagcc 
attattattt cctacaaaat gatgtctgat 
gctgttctga atgatcaaaa caaaattgaa 
attttgtaaa ggatcccccg gggatccctc 
agcgcttgga gccacccgca gttcgaaaaa 
gcgcacattg tgcgacattt tttttgtctg 
acgcgccctg tagcggcgca ttaagcgcgg 
ctacacttgc cagcgcccta gcgcccgctc 
cgttcgccgg ctttccccgt caagctctaa 
gtgctttacg gcacctcgac cccaaaaaac 
catcgccctg atagacggtt tttcgccctt 
gactcttgtt ccaaactgga acaacactca 
aagggatttt gccgatttcg gcctattggt 
acgcgaattt taacaaaata ttaacgctta 
tgcgcggaac ccctatttgt ttatttttct 
gacaataacc ctgataaatg cttcaataat 
atttccgtgt cgcccttatt cccttttttg 
cagaaacgct ggtgaaagta aaagatgctg 
tcgaactgga tctcaacagc ggtaagatcc 



-111- 

acttcttcat ggatattgaa ttggttcctg 660 

ccaaaagctc tcgcaatgta tacttaacag 720 

atcgggccct tcaaacaagt gcggaacttg 780 

tgataaaagc tgcaaaagat atcattgaaa 840 

tttattccta tccggaactc gagcctgtga 900 

ttgcagttgc tttttcaaaa gcgcgtttaa 960 

aggaggtgaa ttaatatgta tcgtacgatg 1020 

acggaagcaa acctgaacta tgtgggaagc 1080 

gtgggaatgc ttcctaatga aaaagtacaa 1140 
t 

gaaacgtata ttattcctgg taaacgggga 1200 
gcacgccttg tgcaggaagg agataaggtc 1260 
caagaagcgg caagccatga gccgaaagtg 1320 
caaatgctgg ggaacgaacc agcccgtaca 1380 
gaggtcgacc tgcaggggga ccatggtctc 1440 
taataagctt gacctgtgaa gtgaaaaatg 1500 
ccgtttaccg ctactgcgtc acggatctcc 1560 
cgggtgtggt ggttacgcgc agcgtgaccg 1620 
ctttcgcttt cttcccttcc tttctcgcca 1680 
atcgggggct ccctttaggg ttccgattta 1740 
ttgattaggg tgatggttca cgtagtgggc 1800 
tgacgttgga gtccacgttc tttaatagtg 1860 
accctatctc ggtctattct tttgatttat 1920 
taaaaaatga gctgatttaa caaaaattta 1980 
caatttcagg tggcactttt cggggaaatg 2040 
aaatacattc aaatatgtat ccgctcatga 2100 
attgaaaaag gaagagtatg agtattcaac 2160 
cggcattttg ccttcctgtt tttgctcacc 2220 
aagatcagtt gggtgcacga gtgggttaca 2280 
ttgagagttt tcgccccgaa gaacgttttc 2340 
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» 

caatgatgag cacttttaaa gttctgctat gtggcgcggt attatcccgt attgacgccg 2400 
ggcaagagca actcggtcgc cgcatacact attctcagaa tgacttggtt gagtactcac 24^60 
cagtcacaga aaagcatctt acggatggca tgacagtaag agaattatgc agtgctgcca 2520 
taaccatgag tgataacact gcggccaact tacttctgac aacgatcgga ggaccgaagg 2580 
agctaaccgc ttttttgcac aacatggggg atcatgtaac tcgccttgat cgttgggaac 2640 
cggagctgaa tgaagccata ccaaacgacg agcgtgacac cacgatgcct gtagcaatgg 2700 
caacaacgtt gcgcaaacta ttaactggcg aactacttac tctagcttcc cggcaaoaat 2760 
tgatagactg gatggaggcg gataaagttg caggaccact tctgcgctcg gcccttccgg 2620 
ctggctggtt tattgctgat aaatctggag ccggtgagcg tggctctcgc ggtatcattg 2880 
cagcactggg gccagatggt aagccctccc gtatcgtagt tatctacacg acggggagtc 294*0 
aggcaactat ggatgaacga aatagacaga tcgctgagat aggtgcctca ctgat^aagc 3000 
attggtagga attaatgatg tctcgtttag ataaaagtaa agtgattaac agcgcattag 3060 . 
agctgcttaa tgaggtcgga atcgaaggtt taacaacccg taaactcgcc cagaagctag 3120- . < 
gtgtagagca gcctacattg tattggcatg taaaaaataa gcgggctttg ctcgacgcct 31*80 . 
tagccattga gatgttagat aggcaccata ctcacttttg ccctttagaa ggggaaag-ct 324*0 
ggcaagattt tttacgtaat aacgctaaaa gttttagatg tgctttacta agtcat-cgcg 3300 , 
atggagcaaa agtacattta ggtacacggc ctacagaaaa acagtatgaa actct<:gaaa 3360 . 
atcaattagc ctttttatgc caacaaggtt tttcactaga gaatgcatta -tatgcactca 3420 
gcgcagtggg gcattttact ttaggttgcg tattggaaga tcaagagcat caagtcgcta 3480 
aagaagaaag ggaaacacct actactgata gtatgccgcc attattacga caagctatcg 3540 
aattatttga tcaccaaggt gcagagccag ccttcttatt cggccttgaa ttgatcatat 3600 
gcggattaga aaaacaactt aaatgtgaaa gtgggtctta aaagcagcat aacctttttc 3660 
cgtgatggta acttcactag tttaaaagga tctaggtgaa gatccttttt gataatctca 3720 
tgaccaaaat cccttaacgt gagttttcgt tccactgagc gtcagacccc gtagaaaaga 3780 
tcaaaggatc ttcttgagat cctttttttc tgcgcgtaat ctgctgcttg caaacaaaaa 3840 
aaccaccgct accagcggtg gtttgtttgc cggatcaaga gctaccaact ctttttccga 3900 
aggtaactgg cttcagcaga gcgcagatac -caaatactgt ccttctagtg tagccgtagt 3960 
taggccacca cttcaagaac tctgtagcac -cgcctacata cct-cgctctg ctaatcctgt 4020 
taccagtggc tgctgccagt ggcgataagt -cgtgtcttac cgggttggac t-caagacgat 4080 



OMervvirv ,wn 
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agttaccgga taaggcgcag cggtcgggct gaacgggggg ttcgtgcaca cagcccagct 4140 
tggagcgaac gacctacacc gaactgagat acctacagcg tgagctatga gaaagcgcca 4200 
cgcttcccga agggagaaag gcggacaggt atccggtaag cggcagggtc ggaacaggag 4^60 
agcgcacgag ggagcttcca gggggaaacg cctggtatct ttatagtpct gtcgggtttc 4320 
gccacctctg acttgagcgt cgatttttgt gatgctcgtc aggggggcgg agcctatgga 4380 
aaaacgccag caacgcggcc tttttacggt tcctggcctt ttgctggcct tttgctcaca 4440 
tgacccgaca 4450 

<210> 81 
<211> 10212 

<212> DNA ' 
<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Recombinant 
pAN251 plasmid 

<400> 81 

gcggccgcta aaaagagctt gaggatttgc ggagtgaaaa tcagacattg cggaatcagc 60 ■ 
tagagatgac agaagaggat tacaaggcac tgatcgatat catggatcgg gccagaaaaa 120* 
tggttgtttc gaaggaagac ggaagaatga aaaaagcggc tcaagaaacg taaagaaacg 180 ■ 
cctgaaatga accggcccta tagtaagaat aggccggttg ttttgatttc tatgcagact 240 \ 
ctcccggtgt catttcgcga tccatatcag gatgccagat gagcgggtct tcccctttgt 300 - 
cccgcgccat atcatactta acagttttaa agttcatttt gttccaaaat tccgctgatt 360 
tcattctcgg atttgtccgg atcggcattt ^gaatgattt tgcaaattca accaaggctc 420 
tcccgtatcc cctgttctgg tagcccggaa gaacctcaag cttccacagc tccaaataat 480 
cctggcggtt gtcaaaatag ggattcgatt tgccgttaac ttgatacaga ctcattcgtg 540 
ctacaagttt atcgccaaaa taaatcccgt aaaaaggcga ggtgctgtca ttttcaataa 600 
tattatcctg aagttcttca agcattgaaa gctcctgaat gccgtattct ttgaatttct 660 
tgaattcttc cagcgtttta tagttgataa gcagacgttc tacctttgtc aaacaaatct 720 
ccccctttgt tgtttctaca tatattgtaa acgctttatt taaaaaatcc aaatatttaa 780 
actttaattt taagcacatg ggatctttga gaagtaattt cttcttactt ctgctatgat 840 
aatacgtaaa tgcgtcgacc gaaacagcag ttataaggca tgaagctgtc cggtttttgc 900 
aaaagtggct gtgactgtaa aaagaaatcg aaaaagaccg ttttgtgtga aaacggtctt 960 
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tttgtttcct tttaaccaac tgccataact cgaggcctac ctagcttcca agaaagatat 1020 
cctaacagca caagagcgga aagatgtttt gttctacatc cagaacaacc tctgctaaaa 1080 
1 ttcctgaaaa attttgcaaa aagttgttga ctttatctac aaggtgtggt ataataatct 1140 
taacaacagc aggacgctct agaggaggag acatcatgaa aattggaatt atcggcggag 1200 
gctccgttgg tcttttatgc gcctattatt tgtcacttta tcacgacgtg actgttgtga 1260 
cgaggcggca agaacaggct gcggccattc agt-ctgaagg aatccggctt tataaaggcg 1320 
gggaggaatt cagggctgat tgcagtgcgg acacgagtat caattcggac tttgacctgc 1380 
ttgtcgtgac agtgaagcag catcagcttc aatctgtttt ttcgtcgctt gaacgaatcg 1440 
ggaagacgaa tatattattt ttgcaaaacg gcatgigggca tatccacgac ctaaaagact 1*500 
ggcacgttgg ccattccatt tatgttggaa tcgttgagca cggagctgta agaaaatcgg 1560 
atacagctgt tgatcataca ggcctaggtg cgataaaatg gagcgcgttc .gacgatgctg 1620 
aaccagaccg gctgaacatc ttgttt-cagc ataaccattc ggattttccg atttat-tatg 1680 
, agacggattg gtaccgtctg ctgacgggca agctgattgt aaatgcgtgt attaatcctt 17 40 
taactgcgtt attgcaagtg aaaaatggag ' aactgctgac aacgccagct tatctggctt 1-800 
ttatgaagct ggtatttcag gaggcatgcc gcattttaaa acttgaaaat gaagaaaagg 1860 
cttgggagcg ggttcaggcc gtttgtgggc aaacgaaaga gaat-cgttca tcaatgctgg 1920 
ttgacgtcat tggaggccgg cagacggaag ot^gacgccat tatcggatac ttattgaagg 1980 
aagcaagtct tcaaggtctt gatgccgtcc acctagagtt tttatatggc agcatcaaag 2040 
cattggagcg aaatacaaac aaagtctttt gagctttttc ggtaacatgc tatact-catt 2100 
tcggatcact aactatttat tggagaaagg aagttctaga agatgcagct aactgaactt 2160 
tccatcaaaa atcagaatgt gtttgtacag cactatatag atggcaaaga agaaatgtct 2220 
tctttttttg attacagtat tcat-cataag gacatgtggc gcgaaagact ggaagactta 2280 
tcttcccggt ttttcgcaag agaggaattg gcggcgtact taacctctta ccataataaa 2340 
ttcggttcaa gtgcgatgca gtctgctatt gagaagctga aggacccgtc aagtgccgct 2400 
gtagtcggcg gacagcaggc aggactttta acaggaccgc tttacaccat acataaaatc 2460 
atttcaatca ttgttttagc aaagcaacaa gaaaaggaac tgcaagtgcc tgtcatacca 2520 
atcttctggg tggctggaga agaccacgat ttggatgaga ttaattttgt tcacacatct 2580 
gaagagaatg ggcctgtgaa aaaaaagctg cctcagtctt attggaagaa atcatcagca 2640 
gcgagtacat cgcttgatca ggaaaagtgt gccgcgtgga tagatgatgt ttttgccgct 2700 
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tttgaagaaa cagaccatac gaatacactt 
tctgttacgt ttactgactt ctttgaactg 
ttagttttat taaattctgg ggatcctgtt 
tacagattaa atcagaacgc agaagcggtc 
gcgcggtggt cccacctgac cccatgccga 
gtagtgtggg gtctccccat gcgagagtag 
gctcagtcga aagactgggc ctttcgtttt 
agtaggacaa atccgccggg agcggatttg 
cgggcaggac gcccgccata aactgccagg 
gatggccttt ttgcgtttct acaaactctt 
ttaaaaaagc gaaatcgaag aagctgtctg 
ctgttgtgtc atctttaaaa gtcacagccg 
tgaacaccca gaaagataaa ggaaatgcag 
ggattgctgt tcatgcctca aaagaggtta 
aaaaggtgct tcctaatgag aaacggcagc 
aaaaagcaca gaaagctatt aaagcaactg 
atgccccaaa agcttgggca cttggatatg 
ataccggggt ggaatggaat catccggcat 
aaaatcctaa tgagcctgaa aatgaaatga 
gcccttatga tgatttggct catggaaccc 
ctgatggaac aaatcaaatc ggtgtagcac 
tctctgaaga tggcggcact gatgctgaca 
caaaggacgc ggaaggaaat ccccacccgg 
ggggaggggg ctctggactt gatgaatggt 
ccgatatttt ccctgagttt tcagcgggga 
gttctatcgc aaatccggca aactatccag 
ccaattcccc atggagagaa aagaaaatcg 
ttcttgaatt taaaaaggct gaaagagtaa 
aaaatcgtga aacaggcgaa agaaagttgt 
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ctcgacaatg tgaaacgatg tttaagggaa 27 60 
ctgatcgcgg atttgttcca agaagagggc 2820 
ttggcggatg agagaagatt ttcagcctga 2880 
tgataaaaca gaatttgcct ggcggcagta 2940 
actcagaagt gaaacgccgt agcgccgatg 3000 
ggaactgcca ggcatcaaat aaaacgaaag 3060 
atctgttgtt tgtcggtgaa cgctctcctg 3120 
aacgttgcga agcaacggcc cggagggtgg 3180 
catcaaatta agcagaaggc catcctgacg 3240 
ggtacccaga aaaagcggca aaagcggctg 3300 
ccgctaagac ggaatatcaa aagcgttctg 3360 
atgaatccca gcaagatgtc ctaaaatact 3420. 
accaaattca ttcttattat gtggtgaacg 3480- 
tggaaaaagt ggtgcagttt cccgaagtgg 3540 
tttttaagtc atcctcccca tttaatatga 3600 
acggtgtgga atggaatgta gaccaaatcg 3660' 
atggaactgg cacggttgtt gcgtccattg 3720 
taaaagagaa atatcgcgga tataatccgg 3780 
actggtatga tgccgtagca ggcgaggcaa 3840 
acgtgacagg cacgatggtg ggctctgaac 3900 
ctggcgcaaa atggattgct gttaaagcgt 3960 
ttttggaagc tggtgaatgg gttttagcac 4020 
aaatggctcc tgatgttgtc aataactcat 4080 
acagagacat ggtcaat^cc tggcgttcgg 4140 
atacggatct ctttattccc ggcgggcctg 4200 
aatcgtttgc aactggagcg actgagaatt 4260 
ctaatgttga ttactttgaa cttctgcata 4320 
aagattgtgc tgaaatatta gagtataaac 4380 
atcgagtgtg gttttgtaaa tccaggcttt 4440 
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gtccaatgtg caactggagg agagcaatga aacatggcat tcagtcacaa aaggttgttg 4500 
ctgaagttat taaacaaaag ccaacagttc -gttggttgtt tctcacatta acagttaaaa 4560 
atgtttatga tggcgaagaa ttaaataaga gtttgtcaga *atggctcaa ggattt<:gcc 4^620 
gaatgatgca atataaaaaa attaataaaa atcttgttgg ttttatgcgt gcaacggaag 4*680 
tgacaataaa taataaagat aattcttata atcagcacat gcatgtattg gtatgtgtgg 4740 
aaccaactta ttttaagaat acagaaaact acgtgaatca aaaacaatgg attcaatttt 4€00 
ggaaaaaggc aatgaaatta gactatgatc caaatgtaaa agtteaaatg attcgaccga 48-60 
aaaataaata taaatcggat atacaatcgg caattgacga aactgcaaaa tatcctgtaa 4 920 
aggatacgga ttttatgacc gatgatgaag aaaagaattt gaaacgtttg tctgatttgg 4980 
aggaaggttt acaccgtaaa aggttaatct cctatggtgg tttgttaaaa gaaatacata 5040 
aaaaattaaa ccttgatgac acagaagaag gcgatttgat tcatacagat gatgacgaaa 5100 
aagccgatga agatggattt tctattattg caatgtggaa ttgggaacgg aaaaattatt 5160 
ttattaaaga gtagttcaac aaacgggcca tattgttgta taagtgatga aatactgaat 5220 
ttaaaactta gtttatatgt ggtaaaatgt tttaatcaag tttaggagga attaattatg 5280 
aagtgtaatg aatgtaacag ggttcaatta aaagagggaa gcgtatcatt eaccctataa 5340 
actacgtctg ccctcattat tggagggtga aatgtgaata catcctattc acaatcgaat 5400 
ttacgacaca accaaatttt aatttggctt tgcattttat -ctttttttag -cgtattaaat 5460 
gaaatggttt tgaacgtctc attacctgat attgcaaatg attttaataa accacctgcg 5520 
agtacaaact gggtgaacac agcctttatg ttaacctttt -ccattggaac agctgtatat 55*0 
ggaaagctat ctgatcaatt aggcatcaaa aggttactcc tatttggaat *ataataaat 5640 
tgtttcgggt cggtaattgg gtttgttggc .cattctttct tttccttact tattatggct 5-700 
cgttttattc aaggggctgg tgcagctgca tttccagcac tcgtaatggt -tgtagttgcg 5760 
cgctatattc caaaggaaaa taggggtaaa gcatttggtc ttattggatc gatagtagcc 5€20 
atgggagaag gagtcggtcc agcgattggt ggaatgatag .cccattatat tcattggtcc 5880 
tatcttctac tcattcctat gataacaatt atcactgttc cgtttcttat gaaattatta 5940 
aagaaagaag taaggataaa aggtcatttt gatatcaaag gaattatact aatgtctgta 6O00 
ggcattgtat tttttatgtt gtttacaaca tcatatagca tttcttttct tatcgttagc 6060 
gtgctgtcat tcctgatatt tgtaaaacat atcaggaaag taacagatcc ttttgtt^gat 6120 
cccggattag ggaaaaatat accttttatg attggagttc tttgtggggg aattatattt €180 
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ggaacagtag 


cagggtttgt 


ctctatggtt 


ccttatatga 


tgaaagatgt 


tcaccagcta 


6240 


agtactgccg 


aaatcggaag 


tgtaattatt 


ttccctggaa 


caatgagtgt 


cattattttc 


6300 


ggctacattg 


gtgggatact 


tgttgataga 


agaggtcctt 


tatacgtgtt 


aaacatcgga 


6360 


gttacatttc 


tttctgttag 


ctttttaact 


gcttcctttc 


ttttagaaac 


aacatcatgg 


6420 


ttcatgacaa 


ttataatcgt 


atttgtttta 


ggtgggcttt 


cgttcaccaa' 


aacagttata 


6480 


tcaacaattg 


tttcaagtag 


cttgaaacag 


caggaagctg 


gtgctggaat 


gagtttgctt 


6540 


aactttacca 


gctttttatc 


agagggaaca 


ggtattgcaa 


ttgtaggtgg 


tttattatcc 


6600 


atacccttac 


ttgatcaaag 


gttgttacct 


atggaagttg 


atcagtcaac 


ttatctgtat 


6660 


agtaatttgt 


tattactttt 


ttcaggaatc 


attgtcatta 


gttggctggt 


taccttgaat 


6720 


gtatataaac 


attctcaaag 


ggatttctaa 


atcgttaagg 


gatcaacttt 


gggagagagt 


6780 


tcaaaattga 


tccttttttt 


ataacagttc 


gaagcggccg 


caat)bcttga 


agacgaaagg 


6840 


gcctcgtgat 


acgcctattt 


ttataggtta 


atgtcatgat 


aataatggtt 


tcttagacgt 


6900 


caggtggcac 


ttttcgggga 


aatgtgcgcg 


gaacccctat 


ttgtttattt 


ttctaaatac 


6960 


attcaaatat 


gtatccgctc 


atgagacaat 


aaccctgata 


aatgcttcaa 


taatattgaa 


702Q 


aaaggaagag 


tatgagtatt 


caacatttcc 


gtgtcgccct 


tattcccttt 


tttgcggcat 


7080 


tttgccttcc 


tgtttttgct 


cacccagaaa 


cgctggtgaa 


agtaaaagat 


gctgaagatc 


7140 


agttgggtgc 


acgagtgggt 


tacatcgaac 


tggatctcaa 


cagcggtaag 


atccttgaga 


7200 


gttttcgccc 


cgaagaacgt 


tttccaatga 


tgagcacttt 


taaagttctg 


ctatgtggcg 


7260 


cggtattatc 


ccgtattgac 


gccgggcaag 


agcaactcgg 


tcgccgcata 


cactattctc 


7320 


agaatgactt 


ggttgagtac 


tcaccagtca 


cagaaaagca 


tcttacggat 


ggcatgacag 


7380 


taagagaatt 


atgcagtgct 


gccataacca 


tgagtgataa 


cactgcggcc 


aacttacttc 


7440 


tgacaacgat 


cggaggaccg 


aaggagctaa 


ccgctttttt 


gcacaacatg 


ggggatcatg 


7500 


taactcgcct 


tgatcgttgg 


gaaccggagc 


tgaatgaagc 


cataccaaac 


gacgagcgtg 


7560 


acaccacgat 


gcctgcagca 


atggcaacaa 


cgttgcgcaa 


actattaact 


ggcgaactac 


7620 


ttactctagc 


ttcccggcaa 


caattaatag 


actggatgga 


ggcggataaa 


gttgcaggac 


7680 


cacttctgcg 


ctcggccctt 


ccggctggct 


ggtttattgc 


tgataaatct 


ggagccggtg 


7740 


agcgtgggtc 


tcgcggtatc 


attgcagcac 


tggggccaga 


tggtaagccc 


tcccgtatcg 


7800 


tagttatcta 


cacgacgggg 


agtcaggcaa 


ctatggatga 


acgaaataga 


cagatcgctg 


7860 


agataggtgc 


ctcactgatt 


aagcattggt 


aactgtcaga 


ccaagtttac 


tcatatatac 


7920 
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tttagattga tttaaaactt catttttaat 
ataatctcat gaccaaaatc ccttaacgtg 
tagaaaagat caaaggatct tcttgagatc 
aaacaaaaaa accaccgcta ccagcggtgg 
tttttccgaa ggtaactggc ttcagcagag 
agccgtagtt aggccaccac ttcaagaact 
taatcctgtt accagtggct gctgccagtg 
caagacgata gttaccggat aaggcgcagc 
agcccagctt ggagcgaacg acctacaccg 
aaagcgccac gcttcccgaa gggagaaagg 
gaacaggaga gcgcacgagg gagcttccag 
tcgggtttcg ccacctctga cttgagcgtc 
gcctatggaa aaacgccagc aacgcggcct 
ttgctcacat gttctttcct gcgttatccc 
ttgagtgagc tgataccgct cgccgcagcc 
aggaagcgga agagcgcctg atgcggtatt 
accgcatatg gtgcactctc agtacaatct 
cactccgcta tcgctacgtg actgggtcat 
tgacgcgccc tgacgggctt gtctgctccc 
ctccgggagc tgcatgtgtc agaggttttc 
gcggtaaagc tcatcagcgt ggtcgtgaag 
gtccagctcg ttgagtttct <ocagaagcgt 
gttaagggcg gttttttcct gtttggtcac 
ttcatggggg taatgatacc gat-gaaacga 
gatgaacatg cccggttact ggaacgttgt 
cgggaccaga gaaaaatcac tcagggtcaa 
gttccacagg gtagccagca gcatcctgcg 
gctgacttcc gcgtttccag actttacgaa 
gctcaggtcg cagacgtttt gcagcagcag 
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ttaaaaggat ctaggtgaag atcctttttg 7S80 
agttttcgtt ccactgagcg tcagaccccg 8040 
cttttttt-ct gcgcgtaatc tgctgcttgc 6lO0 
tttgtttgcc ggatcaagag ctaccaactc 8160 
cgcagatacc aaatactgtc <:ttctagtgt 8220 
ctgtagcacc gcctacatac ctcgctotgc 8280 
gcgataagtc gtgtcttacc gggttggact 5340 
ggfccgggctg aacggggggt t-cgtgcacac 8400 
aactgagata cctacagcgt gagctatgag 84 60 
cggacaggta frccggtaagc ggcagggtcg 8520 
ggggaaacgc ctggtatctt tatagtcctg 8580 
gatttttgtg atgctcgtca ggggggcgga 8640 
ttttacggtt cctggccttt tgctggcctt 8700 
ctgattctgt ggataaccgt attaccgcct 87£0. 
gaacgaccga gcgcagcgag tcagtgagcg 8820 
ttctccttac gcatctgtgc ggtatttcac 8880 
gctctgatgc cgcatagtta agccagtata 8940 
ggctgcgccc cgacacccgc caacacccgc 9000 
ggcatccgct tacagacaag ctgtgaccgt 9060 
accgtcatca ccgaaacgcg cgaggcagct 9120 
cgattcacag atgtctgcct gttca-tccgc 9180 
taatgtctgg cttctgataa agcgggccat 9.240 
ttgatgcctc cgtgtaaggg ggaatttctg 9300 
gagaggatgc t-cacgatacg ggttactgat 9360 
gagggtaaac aactggcggt atggatgcgg 9420 
tgccagcgct tcgttaatac agatgtaggt 9480 
atgcagatcc ggaacataat ggtgcagggc 9540 
acacggaaac cgaagaccat tcatgttgtt 9600 
tcgctbcacg <ttcgct<:gcg tatcggtgat 9660 



RMSrmtrV <WO 0121772A3 |A> 
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9720 
9780 
9840 
9900 
9960 
10020 
10080 
10140 
10200 
10212 



tcattctgct aaccagtaag 
acgatcatgc gcacccgtgg 
ctggagatgg cggacgcgat 
ctccgcaaga attgattggc 
cggcttccat tcaggtcgag 
agacaaggta tagggcggcg 
cggcataaat cgccgtgacg 
egagcgatcc ttgaagctgt 
gcaacgcggg catcccgatg 
agcctcgcgt eg 



gcaaccccgc cagcctagcc 
ccaggaccca acgctgcccg 
ggatatgttc tgccaagggt 
tccaattctt ggagtggtga 
gtggcccggc tccatgcacc 
cctacaatcc atgccaaccc 
ateageggtc cagtgatcga 
ccctgatggt cgtcatctac 
ccgccggaag cgagaagaat 



gggtcctcaa cgacaggagc 
agatgegecg cgtgcggctg 
tggtttgcgc attcacagtt 
ateegttage gaggtgeege 
gcgacgcaac 'geggggagge 
gttccatgtg ctcgccgagg 
agttaggctg gtaagagccg 
ctgcctggac ageatggect 
cataatgggg aaggccatcc 



<210> 82 
<211> 10426 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Recombinant 
pAN267 plasmid 

<400> 82 

aacaaaattc tccagtcttc acatcggttt gaaaggagga ageggaagaa tigaagtaaga 60 

gggatttttg actccgaagt aagtcttcaa aaaatcaaat aaggagtgtc aagaatgttt 120 

gcaaaacgat tcaaaacctc tttactgccg ttattcgctg gatttttatt gctgtttcat ,180 

ttggttctgg caggaccggc ggctgcgagt getgaaaegg cgaacaaatc gaatgagctt 240 

acagcaccgt cgatcaaaag eggaaccatt ettcatgeat ggaattggtc gttcaatacg 300 

ttaaaacaca atatgaagga tattcatgat gcaggatata cagccattca gacatctccg 360 

attaaccaag taaaggaagg gaatcaagga gataaaagca tgtcgaactg gtactggctg 420 

tatcagcega categtatea aattggcaac cgttacttag gtactgaaca agaatttaaa 480 

gaaatgtgtg cagccgctga agaatatggc ataaaggtca ttgttgacgc ggtcatcaat 540 

cataccacca gtgattatgc cgcgatttcc aatgaggtta agagtattcc aaactggaca 600 

catggaaaca cacaaattaa aaactggtct gatcgaaata gtacataatg gatttcctta 660 

cgegaaatae gggcagacat ggcctgcccg gttattatta tttttgacac cagaccaact 720 

ggtaatggta gcgaccggcg ctcaggatcg tctcggtacc aagagtttgt agaaaegcaa 780 

aaaggecate cgtcaggatg gccttctgct taatttgatg cctggcagtt tatggeggge 840 
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gt-cctgcccg ccaccctccg ggccgttgct tcgcaacgtt <^aaatccgct cccggcggat 900 
ttgtcctact caggagagcg ttcaccgaca aacaacagat aaaacgaaag goccagtctt 960 
tcgactgagc ctttcgtttt atttgatgcc tggcagttcc ctactctcgc atggggagac 1020 
cccacactac catcggcgct acggcgtttc actt<:tgagt t<^gcatggg gtcaggtggg 1080 
accaccgcgc tactgccgcc aggcaaattc tgttttatca gaccgcttct . gcgttctgat 1140 
ttaatctgta tcaggctgaa aatcttctct catocgccaa aacaggatcc at<:acgaagc 1200 
gtcgtatcga aaaaattaat tttgcgcaac ggagaccacc gcttccttct tcttgccttg 1260 
tttcacaaac ggcatcattt cacgaagctc tcttcccact acttcgattt gatgttcgtt 1320 
ctcgcttgca ttgatagcgt tgaaacgagg acggtttact tggttttfcga cgatccactc 1380 
ttttgcgaat gtaccgtttt ggatatcttt taatacttct ttcatagatt cttttacttt 1440 
agcgtccaca acgcgagggc ctgatacgaa atctccccac tgtgctgtgt -cagagattga 1500 
atatctcatt cctgcaagtc cttcttcgta cataaggtct acgattaatt tcagctcatg 1560 
aagacactcg aagtatgcaa gttcaggctg ataacctgct tcagttaagg tttcaaat-cc 1620 
ggctttgaca agcgcgctta atccgccgca aagaactgct tgctcaccga acaaatctgt 1680 
ttctgtttct tctttaaatg tcgtttctaa taogcccgct cttgcgccgc cgattccttt 1740 
agcataagcg agggctttgt ctcttgcttc tccagt-caca tcttgataga ttgcgaacaa 1800 
tgcaggtacg ccagctcctt gctcatatgt tcttcttacc aagtgbcccg ggcctttagg 1860 
ggccactaag aatacatcta catccgccgg aggaacaatt .tgatggaaat gcacgttaaa 1920 
tccatgagcg aatactaatg attttcctgc tgtcaattea tctttgattt cagcttcgta 1980 
tactttttgc tgctgctcat ccggaagcag aaccatgatg atttcggctt «gggctgccgc 2040 
ttcttttact gaaaatactt tatgtccgtc ttcttgggct tgagtgaaag atttt<:cttg 2100 
tctaacaccg acgatcacgt ctactccgct ttctttaagg ttcagggcat gtgcgtggcc 2160 
ttgcgaaccg tacccgataa ccgctactgt ttttccagcc aatacgttct ctttgatatc 2220 
accgttataa tatactttta ccatttcaat ctctcccttg ttatgtttta -tacaatagat 2280 
attgttttat tggatgacgc cttttgctgg ttcccctcgc aaaagccgtt gtacctgttc 2340 
tcgcgatttc tttaatgcca taaggtttta ataactcaat aagcgcttca attttgttag 2400 
attcacctgt cacctgaaca acgatgctgt ctctgctgac atcaacgaca gaggctx:taa 2460 
acggttctat gattccatta atctctgttc ttgttgaagg tgcggagaca aocttgatta 2520 
aggccagctc cctctggaca at-cgattgat ttgtgatgto tgtgact*tc agcacatcaa 2S80 



o*#onnr:»fy «wn ni?i772A3 IA> 
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tctgtttgtt gagctgtttc gttaactgtt 
cgaaggtgat tctggaaacg ccggctgttt 
tgtaatgcct ttttgtgaat agaccggtga 
cagtcaatgt gataattctt ttcaaggttt 
cggagccacc atcgggaata ctttttcttc 
ttctcttgat gttaatgcct cttccagctt 
gatgccttta atgccgtatg cttcggacaa 
agattctgaa taacgttctt catagaaaat 
agcgttattt aaaatcacta ccttgaccgg 
ttgaagcgtc atttggaatc cgccgtctcc 
ggccagctgt gcgccgatcg ccgccggaag 
cgtgacccat ttatctgctt tttggaacgg 
gcctacatcc gttgcgacaa tggcctctcc 
tttctgaggt ttaaaacctt cttcttcatt 
ccattctgcg agctgttttt tccattcgct 
cagctcctgc aggacaattt tgctgtctcc 
tccgatttca gctggatcaa tatcgatgtg 
caggtttcct gtgacacggt catcaaaacg 
atgaagggcc atattggctg tataagtacc 
atggtcagcc gggaagcctc cgagccccaa 
ttcagcataa ttttttaatt cttctgacgc 
gatcaccggt tttttcgcac tgctcacggc 
cggctctgtt gtcggctggt atcccgggag 
ttcaattgtt gctacatctt tcggaatatc 
tgcaatatgg aacgcttctt taatgatgcg 
gctgtgtttt gttactggca tcgtaatccc 
cccgattaca gaggttgcta cctgccctgt 
ggcatcagca aggcctgtaa caaggtttgt 
ccggtttccg gagacccttg cgtatccctc 



- 121 - 

caacatcatt ttcaccttca acatgaacga 2640 
ctgtgtgtcc aactgtaatg ctttcaatgt 2700 
tccggtttaa caccccggag cggttcacca 2760 
cacccccacc atttcatgca gccctttccc 2820 
gctggcaacc cgcacgtcaa tgacaacagg 2880 
ttcctttgct tccgcttctg atgaaattct 2940 
tttgacgaag tcaggctgag aagcgaattt 3000 
ttcctgccac tgtctgacca ttccgagaca 3060 
aagatttaat tcgcgaataa catcgagttc 3120 
gacaaccgcg acaacagtag catctttttc 3180 
accgaatccc atcgttccaa gtccgcctga 3240 
ataaaattgc gctgaccaca tttgatgctg 3300 
ttttgtaaat tgatgaatat attcaat-caa 3360 
atcracatac cagagcgget actcttcttt 34 20. 
tgaatcgctt tgtttgccgt cttgtttgat 3480 
gactacagga atctgtgttt tcatgatttt 3540 
ggctatcttt gcgtttctgg caaagtgttt .3600 
ggcgccgata ctgattaata gatcacattc . 3660 
gtgcattccc gccatcccta ggaaaagcgg 3720 
aagggtgtgt gccacaggga tttgctgctg 3780 
ttttccgtgc agtacgcccg cacccgccag 3840 
ttccacaagc ttgcggatct gcaaataatt 3900 
attcatctca tgatcgtagc tgaattctcc 3960 
aatcaataca ggtccgggtc ttccagttgt 4020 
cggcagatct tccggctggc gaacctggta 4080 
taaaatgtct gcttcctgaa atgcatcgct 4140 
aaagacgact aacggcaatg aatcaatcat 4200 
cgctcccggc ctgacgtggc aatgacgaca 4260 
cgctgcatga attgctcctt gttcgtgacg 4320 
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i 

ggaaggatat gtaccaacct gaatgtatag cttatcgtaa atcggaagca cagccccgcc 4360 
cggataaccg aagatcattt <:tactttctc tttttttaat gattcaatca gcattaatog 4440 
tccgctcatc gtctgtgtac attcggcaga tgctgaatcc acctgtacat tagtccccat 4500 
tttatctcct cctctagagc gtcctgctgt tgttaagatt attataccac acctt^taga 4560 
taaagtcaac aactttttgc aaaatttttc aggaatttta gcagaggttg 1 ttctggatgt 4620 
agaacaaaac atctttccgc tcttgtgctg t*aggatat<: tttcttggaa gctaggtagg 4680 
cctcgagtta tggcagttgg ttaaaaggaa aoaaaaagac cgttttcaca caaaacggtc 4740 
tttttcgatt tctttttaca gtcacagcca cttttgcaaa aaccggacag cttcatgcct 4800 
tataactgct gtttcggtcg acgaagcggc cgccgtttaa acgaattfcct gcagctggcg 48^0 
aatggcgatt ttcgttcgtg aatacatgtt ataataacta taactaataa cgtaacgtga 4 920 
ctggcaagag atatttttaa aacaatgaat aggtttacac ttactttagt tttatggaaa 4 980 
tgaaagatca tatcatatat aatctagaat aaaattaact aaaataatta ttatctagat 5040 
aaaaaattta gaagccaatg aaatctataa ataaactaaa ttaagtttat ttaattaaca 5-100 
actatggata taaaataggt actaatcaaa atagtgagga ggatatatt.t gaatacatac 5160. 
gaacaaatta ataaagtgaa aaaaatactt <^ggaaacatt taaaaaataa ccttattggt 5220 
acttacatgt ttggatcagg agttgagagt ggactaaaac caaatagtga tcttgacttt 5280-. 
ttagtcgtcg tatctgaacc attgacagat -caaagtaaag aaatacttat acaaaaaat*. 5340, 
agacctattt caaaaaaaat aggagataaa agcaacttac gatatattga attaacaatt 5400 
attattcagc aagaaatggt accgtggaat cat-cctccca aacaagaatt tatttatgga 54-60 
gaatggttac aagagcttta tgaacaagga tacattcctc agaaggaatt aaattcagat 5520 
ttaaccataa tgctttacca agcaaaacga aaaaataaaa gaatatacgg aaattatgac 5580 
ttagaggaat tactacctga tattccattt tctgatgtga gaagagccat tatggattcg 5640 
tcagaggaat taatagataa ttatcaggat gatgaaacca actctatatt aactttatgc 5700 
cgtatgattt taactatgga -cacgggtaaa atcataccaa aagatattgc gggaaa-tgca 5760 
gtggctgaat cttctccatt agaacatagg gagagaattt tgttagcagt tcgtagttat 5620 
cttggagaga atattgaatg gactaatgaa aatgtaaatt taactataaa ctatttaaat 5880 
aacagattaa aaaaattata aaaaaattga aaaaatggtg gaaacacttt tttcaatttt 5940 
tttgttttat tatttaatat ttgggaaata ttcattctaa tt^gtaatca gattttagaa 6000 
aacaataaac ccttgcatag ^gggatcgat at-ccgtttag gctgggcggt gatagctt-ct -6060 
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cgttoaggca gtacgcctct tttcttttcc agacctgagg gaggcggaaa tggtgtgagg 6120 
ttcccgggga aaagccaaat aggcgatcgc gggagtgctt tatttgaaga tcaggctatc 6180 
actgcggtca atagatttca caatgtgatg gctggacagc ctgaggaact ctcgaacccg 6240 
aatggaaaca accagatatt tatgaatcag cgcggctcac atggcgttgt gctggcaaat 6300 
gcaggttcat cctctgtctc tatcaatacg gcaacaaaat tgcctgatgg caggtatgac 6360 
aataaagctg gagcgggttc atttcaagtg aacgatggta aactgacagg cacgatcaat 6420 
gccaggtctg tagctgtgct ttatcctgat gatattgcaa aagcgcctca tgttttcctt 6480 
gagaattaca aaacaggtgt aacacattct ttcaatgatc aactgacgat taccttgcgt 6540 
gcagatgcga atacaacaaa agccgtttat caaatcaata atggaccaga cgacaggcgt 6600 
ttaaggatgg agatcaattc acaatcggaa aaggagatcc aatttggcaa aacatacacc 6660 
atcatgttaa aaggaacgaa cagtgatggt gtaacgagga ccgagaaata cagttttgtt 6720 
aaaagagatc cagcgtcggc caaaaccatc ggctatcaaa atccgaatca ttggagccag 6780. 
gtaaatgctt atatctataa acatgatggg agccgagtaa ttgaattgac cggatcttgg 684 0-. 
cctggaaaac caatgactaa aaatgcagac ggaatttaca cgctgacgct gcctgcggac 6900 
acggatacaa ccaacgcaaa agtgattttt aataatggca gcgcccaagt gcccggtcag 6960 
aatcagcctg gctttgatta cgtgctaaat ggtttatata atgactcggg cttaagcggt 7-020, 
tctcttcccc attgagggca aggctagacg ggacttaccg aaagaaacca tcaatgatgg 7080 
tttctttttt gttcataaat cagacaaaac ttttctcttg caaaagtttg tgaagtgttg , 7140 
cacaatataa atgtgaaata cttcacaaac aaaaagacat caaagagaaa cataccctgc 7200 
aaggatgctg atattgtctg catttgcgcc ggagcaaacc aaaaacctgg tgagacacgc 7260 
cttgaattag tagaaaagaa cttgaagatt ttcaaaggca tcgttagtga agtcatggcg 7320 
agcggatttg acggcatttt cttagtcggg cggcacctcg ctaacggatt caccactcca 7380 
agaattggag ccaatcaatt cttgcggaga actgtgaatg cgcaaaccaa cccttggcag 74 40 
aacatatcca tcgcgtccgc catctccagc agccgcacgc ggcgcatctc gggcagcgtt 7500 
gggtcctggc cacgggtgcg catgatcgtg ctcctgtcgt tgaggacccg gctaggctgg 7560 
cggggttgcc ttactggtta gcagaatgaa tcaccgatac gcgagcgaac gtgaagcgac 7620 
tgctgctgca aaacgtctgc gacctgagca acaacatgaa tggtcttcgg tttccgtgtt 7680 
tcgtaaagtc tggaaacgcg gaagtcagcg ccctgcacca ttatgttccg gatctgcatc 7740 
gcaggatgct gctggctacc ctgtggaaca cctacatctg tattaacgaa gcgctggcat 7800 
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tgaccctgag tgatttttct ct^gtcccgc 
caacgttcca gtaaccgggc atgttcatca 
cgtttcatcg gtatcattac ccccatgaac 
accaaacagg aaaaaaccgc ccttaacatg 
cttctggaga aactcaacga gctggacgcg 
cacgaccacg ctgatgagct ttaccgcagc 
aacctctgac acatgcagct cccggagacg 
agcagacaag cccgtcaggg cgcgtcagcg 
acccagtcac gtagcgatag cggagtgtat 
ttgtactgag agtgcaccat atgcggtgtg 
accgcatcag gcgctcttcc gcttcctcgc 
tgcggcgagc ggtatcagct cactcaaagg 
ataacgcagg aaagaacatg tgegcaaaag 
ccgcgttgct ggcgtttttc cataggctcc 
gctcaagtca gaggtggcga aacccgacag 
gaagctccct cgtgcgctct cctgttccga 
ttctcccttc gggaagcgtg gcgctttctc 
tgtaggtcgt tcgctccaag ctgggctgtg 
gcgccttatc cggtaactat cgtcttgagt 
tggcagcagc cactggtaac aggattagca 
tcttgaagtg gtggcctaac tacggctaca 
tgctgaagcc agttaccttc ggaaaaagag 
ccgctggtag cggtggtttt tttgtttgca 
ctcaagaaga tcctttgatc ttttctacgg 
gttaagggat tttggtcatg agattatcaa 
aaaaatgaag ttttaaatca atctaaagta 
aatgcttaat cagtgaggca cctatctcag 
cctgactccc cgtcgtgtag ataactacga 
ctgcaatgat accgcgagac ccacgctcac 
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■cgcatccata ccgccagttg tttaccctca 7860 
tcagtaaccc gtatcgtgag catcct<:tct 7920 
agaaatcccc cttacacgga ^ggcatcagtg 7980 
gcccgcttta tcagaagcca gacattaacg 8040 
gatgaacagg cagacatctg • tgaatcgctt 81O0 
tgcctcgcgc gtttcggtga tgacggtgaa 8160 
gtcacagctt gtctgtaagc ggatgccggg 8220 
^gtgttggcg ggtgtcgggg cgcagccatg 8280 
actggcttaa ctatgccfcca tcagagcaga 8340 
aaataccgca cagatgcgta aggagaaaat 8400 
tcactgactc gctgcgctcg gtcgttcggc 8460 
cggtaatacg gttatccaca gaatcagggg 8520 
gccagcaaaa ggccaggaac cgtaaaaagg 85.80 . 
gcccccctga cgagcatcac aaaaatcgac 8640 
gactataaag ataccaggcg tttccccctg 8700 
ccctgccgct taccggatac ctgtocgcct 8960 
atagctcacg ^ctgtaggtat ctcagttcgg 8820 
tgcacgaacc ccccgttcag -cccgaccgct 8880 
ccaacccggt aagacacgac ttatcgccac 8940 
gagcgaggta tgtaggcggt gctacagagt 9000 
ctagaaggac agtatttggt atctgcgctc 90S0 
ttggtagctc ttgatccggc aaacaaacca 9120 
agcagcagat tacgcgcaga aaaaaaggat 9180 
ggtctgacgc tcagtggaac gaaaactcac 9240 
aaaggatctt x:acctagatc cttttaaatt 9300 
tatatgagta aacttggtct gacagttacc 9360 
cgatctgtct atttcgttca tccatagttg 9420 
tacgggaggg cttaccatct ggccccagtg 9480 
cggctccaga tttatcagca ataaaccagc 9540 



rtl^TTMl 14*. 
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cagccggaag ggccgagcgc agaagtggtc ctgcaacttt atccgcctcc atccagtcta 9600 

ttaattgttg ccgggaagct agagtaagta gttcgccagt taatagtttg cgcaacgttg 9660 

, ttgccattgc tgcaggcatc gtggtgtcac gctcgtcgtt tggtatggct tcattcagct 9720 

ccggttccca acgatcaagg cgagttacat gatcccccat gttgtgcaaa aaagcggtta 9780 

i 

gctccttcgg tcctccgatc gttgtcagaa gtaagttggc cgcagtgtta tcactcatgg 9840 

ttatggcagc actgcataat tctcttactg tcatgccatc cgtaagatgc ttttctgtga 9900 

ctggtgagta ctcaaccaag tcattctgag aatagtgtat gcggcgaccg agttgctctt 9960 

gcccggcgtc aacacgggat aataccgcgc cacatagcag aactttaaaa gtgctcatca 10020 

ttggaaaacg ttcttcgggg cgaaaactct caaggatctt accgctgttg agatccagtt 1008 0 

cgatgtaacc cactcgtgca cccaactgat cttcagcatc ttttactttc accagcgttt 10140 

ctgggtgagc aaaaacagga aggcaaaatg ccgcaaaaaa gggaataagg gcgacacgga 10200 

aatgttgaat actcatactc ttcctttttc aatattattg aagcatttat cagggttatt 10260 

gtctcatgag cggatacata tttgaatgta tttagaaaaa taaacaaata ggggttccgc 10320 

gcacatttcc ccgaaaagtg ccacctgacg tctaagaaac cattattatc atgacattaa 1O3S0 

cctataaaaa taggcgtatc acgaggccct ttcgtcttca agaatt 10426 

<210> 83 
<211> 4191 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Recombinant 
pAN263 plasmid 

<400> 83 

ccaatacgca aaccgcctct ccccgcgcgt tggccgattc attaatgcag cccggatccg 60 
aaagaagagg atatcccggt tttacagaag gcattggatg atccaaaggt gtccatcaga 120 
agacaggctg ttgtgtactt aggaatgatt gaaacacctg atgttcttcc tctattgtat 180 
aaagcacttg aggacaaagc tgtatcagtc agaagaacgg ccggagactg cctgtctgat 240 
atcggcgatc ctcaagccat tcctgctatg atcaagtcat taagcgactc cagcaagctt 300 
gttcgctggc gtgccgccat gttcctgtac gaagtcggcg atgaaagtgc aattgaagct 360 
ttgcgcgctg ccgaagatga ccccgaattt gaggtcagcc ttcaagtcaa aatggcgctt 420 
gaacgtattg agcatggaga agaagcaaaa ggttctgttt ggaaacaaat gacggaaagc 480 
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a^aaaaaaag gcgaataaag ataaaaaagg *gcagatcat gcaccttttt tatgtgaatt 540 
ggtcgaccga aacagcagtt ataaggcat^ aagctgtccg gtttttgcaa aagtggctgt 600 
gactgtaaaa agaaatcgaa aaagaccgtt ttgtgtgaaa acggtctttt tgtttccttt €60 
taaccaactg ccataactcg aggcctacct agctt^caag aaagatat-cc taacagcaca 720 
agagcggaaa gatgttttgt tctacatcca gaacaacctc tgctaaaatt cctgaaaaat 760 
tttgcaaaaa gttgttgact ttatctacaa ggtgtggtat aataatctta acaacagcag 840 
gaegctctag aggaggagac accatggcag aattacgcag taatatgatc acacaaggaa 90,0 
tcgatagagc tccgcaccgc agtttgcttc gtgcagcagg ggtaaaagaa gaggatttcg 960 
gcaagccgtt tattgcggtg tgtaattcat acattgatat cgttcocggt catgttcact 1020 
tgcaggagtt tgggaaaatc gtaaaagaag caatcagaga agcagggggc gttccgtttg 1080 
, aatttaatac cattggggta gatgatggca tcgcaatggg gcatatcggt atgagatatt 1140 
cgctgccaag ccgtgaaatt atcgcagact ctgtggaaac ggttgtatcc gcacactggt 1200 
, ttgacggaat ggtctgtatt ccgaactgcg acaaaatcac accgggaatg cttatggcgg 12-60 
caatgcgcat caacattccg acgatttttg tcagcggcgg accgatggog gcaggaagaa 1320 
caagttacgg gcgaaaaatc tccctttcct cagtattcga aggggtaggc gcctaccaag 1380. 
cagggaaaat caacgaaaac gagcttcaag aactagagca gttcggatgc ccaacgtgcg 14 40 
ggtcttgctc aggcatgttt acggcgaact -caatgaactg tct-gtcagaa gcacttggtc 1500 
ttgctttgcc gggtaatgga accattctgg caacatctcc ggaacgcaaa gagtttgtga 1560 
gaaaatcggc tgcgcaatta atggaaacga ttcgcaaaga tatcaaaccg cgtgatattg 1620 
ttacagtaaa agcgattgat aacgcgtttg cactcgatat ggcgctcgga -ggttctacaa 1680 
ataccgttct tcataccctt gcccttgcaa acgaagccgg cgttgaatac tctttagaac 17 4 0 
gcattaacga agtcgctgag cgcgtgccgc acttggctaa gctggcgcct gcatcggatg 1S00 
tgtttattga agatcttcac gaagcgggcg gcgttt<:agc ggctctgaat -gagctttcga 18-60 
agaaagaagg agcgcttcat ttagatgcgc tgactgttac aggaaaaact cttggagaaa 1920 
ccattgccgg acatgaagta aaggattatg acgtcattca cccgctggat caaccattca 1980 
ctgaaaaggg aggccttgct gttttattcg gtaatctagc txrcggacggc gctatcatta 204 0 
aaacaggcgg cgtacagaat gggattacaa gacacgaagg gccggctgtc gtattcgatt 2100 
ctcaggacga ggcgcttgac ggcattatca accgaaaagt aaaagaaggc gacgttgtca 2160 
tcatcagata cgaagggcca aaaggcggac <:tggcatgcc ggaaatgctg gcgccaacat 2220 
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cccaaatcgt tggaatggga ctcgggccaa 
ccggagcctc ccgtggcctc tcaatcggcc 
cgcttgcctt tgttgaaaac ggagaccata 
atgtacaagt gccagaagaa gagtgggaaa 
cgaaagtgaa aaccggctac ctggcacgtt 
gcggtattat gaaaatctag acccctggcg 
ttcccaacag ttgcgcagcc tgaatggcga 
taatgctctg ccagtgttac aaccaattaa 
tcaaatgaaa ctgcaattta ttcatatcag 
gtttctgtaa tgaaggagaa aactcaccga 
atcggtctgc gattccgact cgtccaacat 
aaataaggtt atcaagtgag aaatcaccat 
aaaggttatg catttctttc cagacttgtt 
aatcactcgc atcaaccaaa ccgttattca 
cgcgatcgct gttaaaagga caattacaaa 
ctgccagcgc atcaacaata ttttcacctg 
ctgttttccc agggatcgca gtggtgagta 
gcttgatggt cggaagaggc ataaattccg 
taacatcatt ggcaacgcta cctttgccat 
tcccatacaa tcaatagatt gtcgcacctg 
acccatataa atcagcatcc atgttggaat 
cccgttgaat atggctcata acaccccttg 
ttgttcatga tgatatattt ttatcttgtg 
gacaagatga tcttcttgag atcgttttgg 
aaaaaccgcc ttgcagggcg gtttttcgaa 
aggtaactgg cttggaggag cgcagtcacc 
ggcgcatgac ttcaagacta aetcctctaa 
cttttgcatg tctttccggg ttggactcaa 
cggactgaac ggggggttcg tgcatacagt 
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aagtggcatt gattacggac ggacgttttt: 2280 
acgtatcacc tgaggccgct gagggcgggc 2340 
ttatcgttga tattgaaaaa cgcatcttgg 2400 
aacgaaaagc gaactggaaa ggttttgaac 2460 
attctaaact tgtgacaagt gccaacaccg 2520 
taatagcgaa gaggcccgca ccgatcgccc 2580 
atgagcttgc gccgtcccgt caagtcagcg 2640 
ccaattctga ttagaaaaac tcatcgagca 2700 
gattatcaat accatatttt tgaaaaagcc 27 60 
ggcagttcca taggatggca agatcctggt 2820 » 
caatacaacc tattaatttc ccctcgtcaa 2880 
gagtgacgac tgaatccggt gagaatggca 2940 
caacaggcca gccattacgc tcgtcatcaa 3000 
ttcgtgattg cgcctgagcg agacgaaata 3060 
caggaatcga atgcaaccgg cgcaggaaca 3120 
aatcaggata ttcttctaat acctggaatg '3180 
accatgcatc atcaggagta cggataaaat 3240 
tcagccagtt tagtctgacc atctcatctg 3*300 
gtttcagaaa caactctggc gcatcgggct 3360 * 
attgcccgac attatcgcga gcccatttat 3420 
ttaatcgcgg cctcgacgag caagacgttt 3480 
tattactgtt tatgtaagca gacagtttta 3540 
caatgtaaca tcagagattt tgagacactc 3600 
tctgcgcgta atctcttgct ctgaaaacga 3660 
ggttctctga gctaccaact ctttgaaccg 3720 
aaaacttgtc ctttcagttt agccttaacc 3780 
atcaattacc agtggctgct gccagtggtg 3840 
gacgatagtt accggataag gcgcagcggt 3900 
ccagcttgga gcgaactgcc tacccggaac 3960 
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tgagtgtcag gegtggaatg agacaaacgc ggccataaca gcggaatgac accggtaaac 4020 
cgaaaggcag gaacaggaga gcgcacgagg gagccgccag gggaaacgcc tggtatcttt 4080 
'atagtcctgt cgggtttcgc caccactgat ttgagcgtca gatttcgtga tgcttgtcag 4140 
gggggcggag cctatggaaa aacggctttg ccgcggccct ctcacttccc t 419T 

<210> 84 
<211> 702 
<212> DNA 

<213> Bacillus subtilis 

<220> 

<221> CDS 

<222> (1)..<699) 

<400> 84 

ttg tta ctg gtt ate gat gtg ggg aac acc aat act gta ctt ggt gta 48 
Met Leu Leu Val He Asp Val tSly Asn Thr Asn Thr Val Leu Gly Val 
' 15 10 15 

tat cat gat gga aaa tta gaa tat -cac tgg cgt ata gaa aca age agg 96 
Tyr His Asp Gly Lys Leu «31u Tyr His Trp Arg He Glu Thr Ser Arg 

20 » , , 25 . 30 

cat aaa aca gaa gat gag ttt ggg atg att ttg cge tec tta ttt gat 14 4. 
His Lys Thr Glu Asp -Glu Phe Gly Met He Leu Arg Ser Leu Phe Asp 
35 40 45 

" cac tec ggg ctt atg ttt gaa cag ata gat ggc att att att teg tea 192 
His Ser Gly Leu Met Phe Glu Gin He Asp Gly He He He Ser Ser 
50 55 60 

gta gtg ccg cca ate atg ttt gcg tta gaa aga atg tgc aca aaa tac 240 
Val Val Pro Pro He Met Phe Ala Leu Glu Arg Met <Cys Thr Lys Tyr 
65 70 75 80 

ttt cat ate gag cct caa att gtt ggt -cca ggt atg aaa acc ggt tta 288 
Phe His He Glu Pro Gin He Val -Gly Pro Gly Met Lys Thr -Gly Leu 
85 90 95 

aat ata aaa tat gac aat ccg aaa *gaa gta ggg gca gac aga ate gta 336 
Asn He Lys Tyr Asp Asn Pro Lys Giu Val Gly Ala Asp Acq He Val 
100 105 110 

aat get gtc get gcg ata cac ttg tac ggc aat cca -tta att gtt -gtc 384 
Asn Ala Val Ala Ala He His Leu Tyr X31y Asn Pro JLeu He Val Val 
115 120 125 

gat ttc gga acc gec aca acg tac tgc tat att gat gaa aac aaa -caa 432 
Asp Phe Gly Thr Ala Thr Thr Tyr Cys Tyr He Asp <Slu Asn Lys <31n 
130 135 140 

tac atg ggc ggg gcg att gec cct ggg att aca att teg aca gag gcg 480 
Tyr Met Gly Gly Ala He Ala Pro Gly He Thr He Ser Thr «Glu Ala 
145 150 155 160 
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ctt tac teg cgt gca gca aag ctt cct cgt ate gaa ate ace egg ccc 528 
Leu Tyr Ser Arg Ala Ala Lys Leu Pro Arg lie Glu lie Thr Arg Pro 
165 170 175 

gac aat att ate gga aaa aac act gtt age gcg atg caa tct gga att 576 
Asp Asn He He Gly Lys Asn Thr Val Ser Ala Met Gin Ser Gly He 
180 185 190 

tta ttt ggc tat gtc ggc caa gtg gaa gga ate gtt aag cga atg aaa 624 
Leu Phe Gly Tyr Val Gly Gin Val Glu Gly He Val Lys Arg Met Lys 
195 200 205 

tgg cag gca aaa cag gac cca agg tea ttg cga cag gag gee tgg cgc 672 
Trp Gin Ala Lys Gin Asp Pro Arg Ser Leu Arg Gin Glu Ala Trp Arg 
210 215 220 

cgc tea ttg cga acg aat cag att gta tag 702 
Arg Ser Leu Arg Thr Asn Gin He Val » 
225 230 



<210> 85 
<211> 233 
<212> PRT 

<213> Bacillus subtilis 
<400> 85 

Met Leu Leu Val He Asp Val Gly Asn Thr Asn Thr Val Leu Gly v a l 
1 5 10 15 

Tyr His Asp Gly Lys Leu Glu Tyr His Trp Arg He Glu Thr Ser Arg 
20 25 - 30 

His Lys Thr Glu Asp Glu Phe Gly Met He Leu Arg Ser Leu Phe Asp 
35 40 45 

His Ser Gly Leu Met Phe Glu Gin He Asp Gly lie lie lie Ser Ser 
50 55 60 

Val Val Pro Pro He Met Phe Ala Leu Glu Arg Met Cys Thr Lys Tyr 
65 70 75 80 

Phe His lie Glu Pro Gin He Val Gly Pro Gly Met Lys Thr Gly Leu 
85 90 95 

Asn He Lys Tyr Asp Asn Pro Lys Glu Val Gly Ala Asp Arg lie Val 
100 105 110 

Asn Ala Val Ala Ala He His Leu Tyr Gly Asn Pro Leu He Val Val 
115 120 125 

Asp Phe Gly Thr Ala Thr Thr Tyr Cys Tyr He Asp Glu Asn Lys Gin 
130 135 140 

Tyr Met Gly Gly Ala He Ala Pro Gly He Thr lie Ser Thr Glu Ala 
145 150 155 160 
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Leu Tyr Ser Arg Ala Ala Lys Leu Pro Arg lie -Glu lie Thr Arg Pro 
165 170 175 

Asp Asn He He Gly Lys Asn Thr Val Ser Ala Met Gin Ser Gly He 
i 180 185 ISO 

Leu Phe Gly Tyr Val Gly Gin Val -Glu Gly He Val Lys Arg Met Lys 
195 200 205 

Trp Gin Ala Lys Gin Asp Pro Arg Ser Leu Arg Gin Glu Ala Trp Arg 
210 215 220 

Arg Ser Leu Arg Thr Asn Gin He Val ( 
225 230 

<210> 86 
<211> 1623 
<212> DNA 

<213> Bacillus subtilis 

i 

<220> 
' <221> CDS 
<222> (1) . . (1620) 

<400> 86 

' atg tat ttg gca ttc cag gtg caa, aaa ttg atg egg tat ttg acg ctt 48 
Met Tyr Leu Ala Phe Gin Val Gin Lys Leu Met Arg Tyr Leu Thr Leu 
15 10 15 



tac aag ata aag gac ctg aaa tta teg ttg ccc ggc acg aac aaa acg 
Tyr Lys He Lys Asp Leu Lys Leu Ser Leu Pro Gly Thr Asn Lys Thr 
20 . 25 30 



96 



cag caa ttc atg gec caa gca gtc ggc cgt tta act gga aaa ccg gga .14 4 
Gin Gin Phe Met Ala Gin Ala Val Gly Arg Leu Thr Gly Lys Pro Gly 
35 40 45 

gtc gtg tta gtc aca tea gga ccg ggt gee tot aac ttg gca aca ggc 192 
Val Val Leu Val Thr Ser Gly Pro Gly Ala Ser Asn Leu Ala Thr Gly 
50 55 60 

ctg ctg aca gcg aac act gaa gga -gac cct gtc gtt gcg ctt -get gga 240 
Leu Leu Thr Ala Asn Thr Glu Gly Asp Pro Val Val Ala Leu Ala Gly 
65 70 75 80 

aac gtg ate cgt gca tat cgt tta aaa egg aca cat caa tct ttg gat 288 
Asn Val He Arg Ala Tyr Arg Leu Lys Arg Thr His Gin Ser Leu Asp 
85 • 90 95 

aat gcg gcg eta ttc cag ccg att aca aaa tac agt gta -gaa gtt oaa 336 
Asn Ala Ala Leu Phe Gin Pro He Thr Lys Tyr "Ser Val Glu Val -Gin 
100 105 HO 

gat gta aaa aat ata ccg gaa get gtt aca aat gca ttt agg ata gcg 384 
Asp Val Lys Asn He Pro Glu Ala Val Thr Asn Ala Phe Arg He Ala 
115 120 125 

tea gca ggg cag get g;gg gee get ttt gtg age ttt ccg caa gat gtt 432 
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Ser Ala Gly Gin Ala Gly Ala Ala Phe Val Ser Phe Pro Gin Asp Val ' 
130 135 140 

gtg aat gaa gtc aca aat acg aaa aac gtg cgt get gtt gca gcg cca 480 
Val Asn Glu Val Thr Asn Thr Lys Asn Val Arg Ala Val Ala Ala Pro 

145 150 155 160 

aaa etc ggt cct gca gca gat gat gca ate agt gcg gee ata gca aaa 528 

Lys Leu Gly Pro Ala Ala Asp Asp Ala He Ser Ala Ala lie Ala Lys 

165 170 175 

ate caa aca gca aaa ctt cct gtc gtt ttg gtc ggc atg aaa ggc gga 576 

He Gin Thr Ala Lys Leu Pro Val Val Leu Val Gly Met Lys Gly Gly 
180 185 ~ 190 

aga ccg gaa gca att aaa gcg gtt cgc aag ctt ttg aaa aag gtt cag 624 

Arg Pro Glu Ala He Lys Ala Val Arg Lys Leu Leu Lys Lys Val Gin 
195 200 205 

ctt cca ttt gtt gaa aca tat caa get gee ggt acc ctt tct aga gat 672 

Leu Pro Phe Val Glu Thr Tyr Gin Ala Ala Gly Thr Leu Ser Arg Asp 
210 215 220 

tta gag gat caa tat ttt ggc cgt ate ggt ttg ttc cgc aac cag cct 720 
Leu Glu Asp Gin Tyr Phe Gly Arg He Gly Leu Phe Arg Asn Gin Pro 

225 230 235 ~ 240 

ggc gat tta ctg eta gag cag gca gat gtt gtt ctg aeg ate ggc tat 768 

Gly Asp Leu Leu Leu Glu <51n Ala Asp Val Val Leu Thr He Gly Tyr 

245 250 255 

gac ccg att gaa tat gat ccg aaa ttc tgg aat ate aat gga gac egg 816 
Asp Pro He Glu Tyr Asp Pro Lys Phe Trp Asn He Asn Gly Asp Arg 
260 265 270 

aca att ate cat tta gac gag att ate get gac att gat cat get tac 864 

Thr He He His Leu Asp Glu He He Ala Asp He Asp His Ala Tyr 
275 280 285 

cag cct gat ctt gaa ttg ate ggt gac att ccg tec aeg ate aat cat 912 

Gin Pro Asp Leu Glu Leu He Gly Asp He Pro Ser Thr He Asn His 
290 295 300 

ate gaa cac gat get gtg aaa gtg gaa ttt gca gag cgt gag cag aaa 960 

He Glu His Asp Ala Val Lys Val Glu Phe Ala Glu Arg Glu Gin Lys 

305 310 315 320 

ate ctt tct gat tta aaa caa tat atg eat gaa ggt gag cag gtg cct 1008 

He Leu Ser Asp Leu Lys Gin Tyr Met His Glu Gly Glu Gin Val Pro 

325 330 " 335 

gca gat tgg aaa tea gac aga gcg cac cct ctt gaa ate gtt aaa gag 1056 

Ala Asp Trp Lys Ser Asp Arg Ala His Pro Leu Glu He Val Lys Glu 
340 345 350 

ttg cgt aat gca gtc gat gat cat gtt aca gta act tgc gat ate ggt 1104 

Leu Arg Asn Ala Val Asp Asp His Val Thr Val Thr Cys Asp He Gly 
355 360 365 
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teg cac tec att tgg atg tea cgt tat ttc cgc ag;c tac gag ccg tta 1152 
Ser His Ser lie Trp Met Ser Arg Tyr Phe Arg Ser Tyr Glu Pro Leu 
370 375 380 

i 

aca tta atg ate agt aac ggt atg caa aca etc ggc <gtt gcg ctt cct 
Thr Leu Met He Ser Asn Gly Met Gin Thr Leu Gly Val Ala Leu Pro 

390 395 400 



385 



1200 



1248 



tgg gca ate ggc get tea ttg gtg aaa ccg gga gaa aaa gtg gtt tct 
Trp Ala He Gly Ala Ser Leu Val Lys Pro Gly 'Glu Lys Val Val Ser 
405 410 415 

gtc tct ggt gac ggc ggt ttc tta ttc tea gca atg gaa tta gag aca 
Val Ser <51y Asp <51y Gly Phe Leu Phe Ser Ala Wet Glu Leu Glu Thr 
420 425 430 

gca gtt cga eta aaa gca cca att gta cac att gta tgg aac gac age 
Ala Val Arg Leu Lys Ala Pro He Val His He Val Trp Asn Asp Ser 
435 440 445 

'aca tat gac atg gtg cat ttc cag caa ttg aaa aaa tat aac cgt aca 
Thr Tyr Asp Met Val His Phe -Gin Gin Leu Lys Lys Tyr Asn Arg Thr 
450 455 460 

tct gcg gtc gat ttc gga aat ate , gat ate gtg aaa, tat gcg gaa age 
Ser Ala Val Asp Phe Gly Asn He Asp He Val Lys Tyr Ala Glu Ser 
465 470 475 480 

ttc gga gca act gcg ttg cgc gta gaa tea cca gac cag ctg gca gat 
Phe Gly Ala Thr Ala Leu Arg Val "Glu Ser Pro Asp Gin Leu Ala Asp 
485 490 495 

gtt ctg cgt caa ggc atg aac get gaa ggt cct gtc ate ate gat gtc .1536 
Val Leu Arg Gin Gly Met Asn Ala Glu Gly Pro Val He He Asp Val 
500 505 510 



1296 



1344 



1392 



1440 



1488 



ccg gtt gac tac agt gat aac att aat tta gca agt gac aag ctt «ccg 
Pro Val Asp Tyr Ser Asp Asn He Asn Leu Ala Ser Asp Lys Leu Pro 
515 520 525 

aaa gaa ttc ggg gaa etc atg aaa acg aaa get etc tag 
Lys Glu Phe Gly Glu Leu Met Lys Thr Lys Ala Leu 
530 " 535 540 



<210> 87 
<211> 540 
<212> PRT 

<213> Bacillus subtilis 
<400> 87 

Met Tyr Leu Ala Phe Gin Val Gin Lys Leu Wet Arg Tyr Leu Thr Leu 
15 10 15 

Tvr Lvs He Lys Asp Leu Lys Leu Ser Leu Pro Gly Thr Asn Lys Thr 
20 25 30 



1584 



1623 
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Gin Gin Phe Met Ala Gin Ala Val Gly Arg 
35 40 



Leu Thr Gly Lys Pro Gly 
45 



Val Val Leu Val Thr Ser Gly Pro Gly Ala 

50 -55 



Ser Asn Leu Ala' Thr Gly 
60 



Leu Leu Thr Ala Asn Thr Glu Gly Asp Pro 
65 70 

Asn Val He Arg Ala Tyr Arg Leu Lys Arg 
85 90 

Asn Ala Ala Leu Phe Gin Pro He Thr Lys 
100 105 

Asp Val Lys Asn He Pro Glu Ala Val Thr 
115 120 

Ser Ala Gly Gin Ala Gly Ala Ala Phe Val 
130 135 

Val Asn Glu Val Thr Asn Thr Lys Asn Val 
145 150 

Lys Leu Gly Pro Ala Ala Asp Asp Ala He 
165 170 

lie Gin Thr Ala Lys Leu Pro Val Val Leu 
180 185 

Arg Pro Glu Ala He Lys Ala Val Arg Lys 
195 200 

Leu Pro Phe Val Glu Thr Tyr Gin Ala Ala 
210 215. 



Val Val Ala Leu Ala Gly 
75 80 

Thr His Gin Ser Leu Asp 
95 

Tyr Ser Val Glu Val Gin 
110 

Asn Ala Phe Arg He Ala 
125 

Ser Phe Pro Gin Asp Val 
140 

Arg Ala Val Ala Ala Pro 
155 160 

Ser Ala Ala He Ala Lys 
175 

Val Gly Met Lys Gly Gly 
190 

Leu Leu Lys Lys Vai Gin 
205 

Gly Thr Leu Ser Arg Asp 
220 



Leu Glu Asp Gin Tyr Phe Gly Arg lie Gly 
225 230 

Gly Asp Leu Leu Leu Glu Gin Ala Asp Val 
245 ~ 250 

Asp Pro He Glu Tyr Asp Pro Lys Phe Trp 
260 265 

Thr He He His Leu Asp Glu He He Ala 
275 280 



Leu Phe Arg Asn Gin Pro 
235 240 

Val Leu Thr lie Gly Tyr 
255 

Asn lie Asn Gly Asp Arg 
270 

Asp lie Asp His Ala Tyr 
285 



Gin Pro Asp Leu <31u Leu lie Gly Asp He 
290 295 

He Glu His Asp Ala Val Lys Val Glu Phe 
305 310 

He Leu Ser Asp Leu Lys Gin Tyr Met His 

325 330 



Pro Ser Thr He Asn His 
300 

Ala Glu Arg Glu Gin Lys 
315 320 

Glu Gly Glu Gin Val Pro 
335 



Ala Asp Trp Lys Ser Asp Arg Ala His Pro Leu Glu He Val Lys Glu 
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340 345 ' 350 

Leu Arg Asn Ala Val Asp Asp His Val Thr Val Thr Cys Asp lie -Gly 
355 360 365 

i 

Ser His Ser lie Trp Met Ser Arg Tyr Phe Arg Ser Tyr Glu Pro Leu 
370 375 380 

Thr Leu Met He Ser Asn Gly Met Gin Thr Leu Gly Val Ala Leu Pro 
385 390 395 4O0 

Trp Ala He Gly Ala Ser Leu Val Lys Pro Gly "Glu Lys Val Val Ser 
405 410 415 

Val Ser Gly Asp Gly Gly Phe Leu Phe Ser Ala Met Glu Leu Glu Thr 
420 425 430 

Ala Val Arg Leu Lys Ala Pro He Val His He Val Trp Asn Asp ^Ser 
435 440 445 

Thr Tyr Asp Met Val His Phe Gin Gin Leu Lys Lys Tyr Asn Arg Thr 
450 * 455 460 

Ser Ala Val Asp Phe Gly Asn He Asp He Val Lys Tyr Ala -Glu Ser 
4 65 470 475 480 

Phe Gly Ala Thr Ala Leu Arg Val -Glu Ser Pro Asp Gin Leu Ala Asp . 

485 490 495 

Val Leu Arg Gin Gly Met Asn Ala -Glu Gly Pro Val He lie Asp Val 
500 ' 505 510 

Pro Val Asp Tyr Ser Asp Asn He Asn Leu Ala -Ser Asp Lys Leu Pro 
515 520 525 

Lys Glu Phe Gly Glu Leu Met Lys Thr Lys Ala Leu 
530 " 535 540 



<210> 88 
<211> 23 
<212> DNA 

<213> Artificial Sequence* 
<220> 

<223> Description of Artificial Sequence: ribosome 
binding site 

<220> 

<223> All occurrences of n indicate any nucleotide 
<400> 88 

agaaaggagg tgannnnnnn atg 



<210> 89 
<211> 7 
<212> PRT 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PanC 
C terminus 1 

<400> 89 

He Arg Glu Met Glu Arg He • 
1 5 



<210> 90 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PanC 
C terminus 

<400> 90 

He Arg Glu Arg Arg 
1 5 



<210> 91 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PanC 
C terminus 

<400> 91 

He Arg Arg Lys Glu Val Asn 
1 5 

<210> 92 
<211> 6688 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Recombinant 
pAN336 plasmid 

<400> 92 

tgcgccgcta cagggcgcgt ccattcgcca ttcaggctgc gcaactgttg ggaagggcga 60 
tcggtgcggg cctcttcgct attacgccag ctggcgaaag ggggatgtgc tgcaaggcga 120 
ttaagttggg taacgccagg gttttcccag tcacgacgtt gtaaaacgac ggccagtgaa 180 
ttgtaatacg actcactata gggcgaattg ggcccgacgt cgcatgcacc aggcttctca 240 
ggcgctgact tagaaaacct cttgaatgaa gctgcgcttg tagcggctcg tcaaaacaag 300 
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aaaaaaatcg atgcgcgtga tattgacgaa gcgacggacc gtgtaattgc cggacccgct 360 
aagaagagcc gcgttatctc caagaaagaa cgcaatatcg tggcttatca cgaaggcgga 420 
cacaccgtta tcggtctcgt tttagatgag gcagatatgg ttcataaagt aacgattgtt 480 
cctcggggcc aggctggcgg ttatgctgtt atgctgccaa gagaagaccg ttatttccaa 54 0 
acaaagccgg agctgcttga taaaattgtc ggcctcttgg gcggacgtgt tgctgaagag 600 
attatcttcg gtgaagtcag cacaggggcg cacaatgact tccagcgtgc gacgaatatt *660 
gcaagacgaa tggttacaga attcggtatg tcagaaaaac tgggaccgtt gcaatttgga 720 
cagtctcagg gcggtcaggt attcttaggc cgtgatttca acaacgaaca gaactacagt 780 
gatcaaatcg cttacgaaat tgatcaggaa att-cagcgca toatcaaaga atgttatgag 840 
cgtgcgaaac aaatcctgac tgaaaatcgt gacaagcttg aattgattgc ccaaacgctt 900 
ctgaaagttg aaacgcttga cgctgaacaa atcaaacacc ttatcgat-ca tggaacatta 960 
cctgagcgta atttctcaga tgatgaaaag aacgatgatg tgaaagtaaa cattctgaca 1020 
aaaacagaag aaaagaaaga cgatacgaaa gagtaattcg -ctttctttct aaaaaaactg 1080 
ccggctgacg ctggcagttt ttttatgtaa atgattggct cagctgcggc ttttacaatc 1140 
atccaattct ggtatcgatt tgtttacaaa tgagccgctg atcgtgtatg gtattgtaga 1200 
atgtttgtaa. aaagtaaagt agagaaacta ttcaaaagtg ^tgatagagg ttgttactgg, 1260 
ttatcgatgt ggggaacacc ctgcagctcg agtgaaatac cgcacagatg cgtaaggaga 1320 
aaataccgca tcaggcgata aacccagcga accatttgag gtgataggta ag at tat acc. 1380 
gaggtatgaa aacgagaatt ggacctttac agaattactc tatgaagcgc catatttaaa 14 40 
aagctaccaa gacgaagagg atgaagagga tgaggaggca gattgccttg aatatattga 1500 
caatactgat aagataatat atcttttata tagaagatat -cgccgtatgt aaggatttca 1560 
gggggcaagg cataggcagc gcgcttatea atatatctat agaatgggca aagcataaaa 1620 
acttgcatgg actaatgctt gaaacccagg acaataacct tatagcttgt aaattctatc 1680 
ataattgtgg tttcaaaatc ggctccgtcg atactatgtt atacgccaac tttcaaaaca 1740 
actttgaaaa agctgttttc tggtatttaa ggttttagaa tgcaaggaac agtgaattgg 1800 
agttcgtctt gttataatta gcttcttggg gtatctttaa atactgtaga aaagaggaag 1860 
gaaataataa atggctaaaa tgagaatatc accggaattg aaaaaactga tcgaaaaata 1920 
ccgctgcgta aaagatacgg aaggaatgtc tcctgctaag gtatataagc tggtgggaga 1980 
aaatgaaaac ctatatttaa aaatgacgga cagccggtat aaagggacca cctatgatgt 2040 
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ggaacgggaa 


aaggacatga 


tgctatggct 


gcactttgaa 


cggcatgatg 


gctggagcaa 


ttgctcggaa 


gagtatgaag 


atgaacaaag 


gtgcatcagg 


ctctttcact 


ccatcgacat 


cagccgctta 


gccgaattgg 


attacttact 


aaactgggaa 


gaagacactc 


catttaaaga 


ggaaaagccc 


gaagaggaac 


ttgtcttttc 


tgtgaaagat 


ggcaaagtaa 


gtggctttat 


gtggtatgac 


attgccttct 


gcgtccggtc 


tgtcgagcta 


ttttttgact 


tactggggat 


tattttactg 


gatgaattgt 


tttagtacct 


aagcttttta 


gacatctaat 


cttttctgaa 


tttatatctt 


ttctaaaagt 


tcgctagata 


tcgccattcg 


ccattcaggc 


tgcgcaactg 


caggcaaaac 


aggacccaag 


gtcattgcga 


aatcagattg 


tatagacatc 


gttgatccat 


atgaaagaaa 


ccgcgtagga 


agtgtatagg 


acttgcgtat 


gacggaaaag 


ttcgggctta 


ggggcagaga 


cgccatggta 


cgtggccgac 


agcttcactt 


atgctcggcg 


ctatgctgaa 


gggcggaggt 


ccgatcggag 


ctattgtagc 


ctatgtctct 


aacccgcaag 


ttcattttga 


acgtgcggtt 


ggaacaaacg 


gaacgttaag 


cttcacagga 


caagtagaaa 


tcgtttcagg 


tgtgtcatct 


gagcaggttc 


cttcatcagt 


taccattctt 


gcggcagggg 


gctttattat 


aatcacaaaa 


attgaacagc 


gtctatctca 


agggctgaca 


ccagaagaaa 


ttttagaaga 


aacgatgcct 


gtcagattcc 


attgcccttg 
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ggaaggaaag ctgcctgttc caaaggtcct 2100 
tctgctcatg agtgaggccg atggcgtcct 2160 
ccctgaaaag attatcgagc tgtatgcgga 2220 
atcggattgt ccctatacga atagcttaga 2280 ' 
gaataacgat ctggccgatg ■ tggattgcga 2340 
tccgcgcgag ctgtatgatt ttttaaagac 2400, 
ccacggcgac ctgggagaca gcaacatctt 2460 
tgatcttggg agaagcggca gggcggacaa 2520 
gatcagggag gatatcgggg aagaacagta 2580 
caagcctgat tgggagaaaa taaaatatta 2640 
agatttagat gtctaaaaag ctttaactac 2700 
gtacatccgc aactgtccat actctgatgt. 2760 
ggggtcccga gcgcctacga ggaatttgta 2820 
ttgggaaggg cgatcggtgc ggtcgactgg : 2880 
caggaggcct ggcgccgctc attgcgaacg 2940 
tcttaaccct aaaagggctg gaattgattt- 3000 
aggtttagta atggattatt tagtaaaagc 3060 
tgcagcgaga acgactgata tggtaaatga 3120 
agcatccgct gcactaggcc gtacaatgac 3180 
gggcgatgat aagctgaccg tgaaaatcga 324 0 
tgatgccaat gccaaaggag aagtcagagc 3300 
tttaaatgaa caaggtaagc ttgatgtcag 3360 
tgtcgtaaaa gatttaggtt tgcgcgagtt 3420 
agaattagga gatgatttta cttactatct 3480 
gggcgtaggt gtgctcgtaa atcctgacaa 3540 
tcagctgatg ccgggaacag atgatgaaac 3600 
agtagagccg atttctaagc tcatccaaaa 3660 
agtcctaggc gagaaacctg agattttgga 3720 
ttcaaaagaa cggttcgaaa cagccatttt 3780 
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aggactaggc aaaaaagaaa ttcaagatat 
atgccatttt tgtaatgaaa agtacttatt 
ccaaactacc cgctaagctc tttagcgggt 
gtttgaaatc aagaacaatc tggacgcgtt 
acctaaatag cttggcgtaa tcatggtcat 
tcacaattcc acacaacata cgagccggaa 
gagtgagcta actcacatta attgcgttgc 
tgtcgtgcca gctgcattaa tgaatcggcc 
ggcgctcttc cgcttcctcg ctcactgact 
cggtatcagc tcactcaaag gcggtaatac 
'gaaagaacat gtgagcaaaa ggccagcaaa 
tggcgttttt -cgataggctc cgcccccctg 
agaggtggcg aaacccgaca ggactataaa 
tcgtgcgctc tcctgttccg accctgccgc 
cgggaagcgt ggcgctttct ^catagctcac 
ttcgctccaa. gctgggctgt gtgcacgaac 
ccggtaacta tcgtcttgag tccaacccgg 
ccactggtaa caggattagc agagcgaggt 
ggtggcctaa ctacggctac actagaagga 
cagttacctt cggaaaaaga gttggtagct 
gcggtggttt ttttgtttgc aagcagcaga 
atcctttgat cttttctacg gggtctgacg 
ttttggtcat gagattatca aaaaggatct 
gttttaaatc aatctaaagt atatatgagt 
tcagtgaggc acctatctca gcgatctgtc 
ccgtcgtgta gataactacg atacgggagg 
taccgcgaga cccacgctca ccggctccag 
gggccgagcg .cagaagtggt cctgcaactt 
gccgggaagc tagagtaagt agttcgccag 
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gatagaagaa gatggacaag ccgaagcagt 3840 
tacaaaagaa gagctggaag ggcttcgtga 3900 
ttttaatttg agaaaagggg ctgaaagcag 3960 
ggatgcatag cttgagtatt ctatagtgtc 4020 
agctgtttcc tgtgtgaaat tgttatccgc 4080 
gcataaagtg taaagcctgg ggtgcctaat 4140 
gctcactgcc cgctttccag tcgggaaacc 4200 
aacgcgcggg gagaggcggt ttgcgtattg 4260 
cgctgcgctc ggtcgttcgg ctgcggcgag 4320 ( 
ggttatccac agaatcaggg gataacgcag 4380 
aggccaggaa t:cgtaaaaag gccgcgttgc 44 40 
acgagcatca caaaaatcga cgctcaagtc 4S00 
<jataccaggc gtttccccct ggaagctccc 4560 . • 

txaccggata -cctgtccgcc tttctccctt' 4620 
gctgtaggta tctcagttcg gtgtaggtcg- 4 680 
cccccgttca gcccgaccgc tgcgccttat- 4740 
taagacacga cttatcgcca ctggcagcag 4800 
atgtaggcgg tgctacagag ttcttgaagt- 4860 
cagtatttgg tatctgcgct ctgctgaa-gc 4920 
cttgatccgg caaacaaacc accgctggta 4 980 
ttacgcgcag aaaaaaagga tctcaagaag 5040 
ctcagtgg^a cgaaaacfrca cgttaaggga 5100 
tcacctagat ccttttaaat taaaaatgaa 51-60 
aaacttggtc tgacagttac <:aatgcttaa 5220 
tatttcgttc atccatagtt gcctgactcc 5280 
gcttaccatc tggccccagt gctgcaatga 5340 
atttatcagc aataaaccag ccagccgcfaa 54O0 
tatccgcctc -catccagtct attaattgtt 5460 
ttaatagttt gcgcaacgtt gttggcattg 5520 
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ctacaggcat cgtggtgtca cgctcgtcgt ttggtatggc ttcattcagc tccggttccc 5580 
aacgatcaag gcgagttaca tgatccccca tgttgtgcaa aaaagcggtt agctccttcg 5640 
gtcctccgat cgttgtcaga agtaagttgg ccgcagtgtt atcactcatg gttatggcag 5700 
cactgcataa ttctcttact gtcatgccat ccgtaagatg cttttctgtg actggtgagt 5760 
actcaaccaa gtcattctga gaataccgcg cccggcgacc gagttgctct tgcccggcgt 5820 
caatacggga taatagtgta tgacatagca gaactttaaa agtgctcatc attggaaaac 5880 
gttcttcggg gcgaaaactc tcaaggatct taccgctgtt gagatccagt tcgatgtaac 5940 
ccactcgtgc acccaactga tcttcagcat cttttacttt caccagcgtt tctgggtgag 6000 
caaaaacagg aaggcaaaat gccgcaaaaa agggaataag ggcgacacgg aaatgttgaa 6060 
tactcatact cttccttttt caatattatt gaagcattta tcagggttat tgtctcatga 6120 
gcggatacat atttgaatgt atttagaaaa ataaacaaat aggggttccg cgcacatttc 6180 
1 cccgaaaagt gccacctgta tgcggtgtga aataccgcac agatgcgtaa ggagaaaata 6240 
ccgcatcagg cgaaattgta aacgttaata ttttgttaaa attcgcgtta aatatttgtt' 6300 
aaatcagctc attttttaac caataggccg aaatcggcaa aatcccttat aaatcaaaag' 6360 
aatagaccga gatagggttg agtgttgttc cagtttggaa caagagtcca ctattaaaga 6420 
acgtggactc caacgtcaaa gggcgaaaaa ccgtctatca gggcgatggc ccactacgtg. 6480 
aaccatcacc caaatcaagt tttttgcggt cgaggtgccg taaagctcta aatcggaacc 65^0 
ctaaagggag cccccgattt agagcttgac ggggaaagcc ggcgaacgtg gcgagaaagg' 6600 
aagggaagaa agcgaaagga gcgggcgcta gggcgctggc aagtgtagcg gtcacgctgc 6660 
gcgtaaccac cacacccgcc gcgcttaa 6688 

<210> 93 
<211> 8503 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Recombinant 
pAN004 plasmid 

<400> 93 

gaattttgcg gccgcttcga aagctgtaat ataaaaacct tcttcaacta acggggcagg 60 
ttagtgacat tagaaaaccg actgtaaaaa gtacagtcgg cattatctca tattataaaa 120 
gccagtcatt aggcctatct gacaattcct gaatagagtt cataaacaat cctgcatgat 180 
aaccatcaca aacagaatga tgtacctgta aagatagcgg taaatatatt gaattacctt 240 
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tattaatgaa ttttcctgct gtaataatgg gtagaaggta attactatta ttattgatat 300 
ttaagttaaa cccagtaaat gaagtccatg gaataataga aagagaaaaa gcattttcag 360 
gtataggtgt tttgggaaac aatttccccg aaccattata tttctctaca tcagaaaggt 420 
ataaatcata aaactctttg aagtcattct ttacaggagt x:caaatacca gagaatgttt 480 
tagatacacc atcaaaaatt gtataaagtg gctctaactt atcccaataa cctaactctc 540 
cgtcgctatt gtaaccagtt ctaaaagctg tatttgagtt tatcaccctt gtcactaaga 600 
aaataaatgc agggtaaaat ttatatcctt cttgttttat gtttcggtat aaaacactaa 660 
tatcaatttc tgtggttata ctaaaagtcg tttgttggtt caaataatga ttaaatatct 720 
cttttctctt ccaattgtct aaatcaattt tattaaagtt -catttgatat gcctcctaaa 780 
tttttatcta aagtgaattt aggaggctta cttgtctgct -ttcttcatta gaatcaatcc 840 
ttttttaaaa gtcaatatta ctgtaacata aatatatatt ttaaaaatat cccactttat 900 
ccaattttcg tttgttgaac taatgggtgc tttagttgaa gaataaagac cacattaaaa 960 
aatgtggtct tttgtgtttt tjttaaaggat ttgagcgtag cgaaaaatcc ttttctttct 1020 
tatcttgata ataagggtaa ctattgaatt cggtaccaag agtttgtaga aacgcaaaaa 1080 
ggccatccgt caggatggcc ttctgcttaa tttgatgcct ggcagtttat ggcgggcgtc 1140 
ctgcccgcca ccctccgggc cgttgcttcg caacgttcaa atccgctccc ggcggatttg; 1200 
tcctactcag gagagcgttc accgacaaac aacagataaa aogaaaggcc cagtctttcg* 1260 
actgagcctt tcgttttatt tgatgcctgg cagttcccta .ctctcgcatg gggagacccc 1320 
acactaccat cggcgctacg gcgtttcact £ctgagtt<cg -gcatggggtc aggtgggacc 1380 
accgcgctac tgccgccagg <caaattctgt tttatcagac cgctt^tgcg ttctgattta 1440 
atctgtatca ggctgaaaat cttctctcat ccgccaaaac aggatcctac ggaaatggag 1500 
cggcaaaacc gttttactct caaaatctta aaagaaaacc cccgataaag ggggcttttc 1560 
ttctacaaaa ttgtacgggc tggttcgttc cccagcattt gttcaatttt gttttgatca 1620 
ttcagaacag ccactttcgg cfccatggctt gccgcttctt gatcagacat <:attttgtag 1680 
gaaataataa tgaccttatc tccttcctgc acaaggcgtg cggctgcacc gtttaagcat 1740 
atgacgccgc ttccccgttt accaggaata atatacgttt -caagacgtgc *ccattatta 1800 
ttattcacaa tttgtacttt ttcattagga agcattccca cagcatcaat gagatctt-ca 18-60 
tcaattgtaa tgcttcccac atagttcagg tttgcttccg taacagttgc cctgtgaagt 1920 
ttgccgctca tcattgttcg atacatatta tatt-ct<:tcc atttctcgaa tatcaataat 1980 
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gatattatct attaaacgcg cttttgaaaa 
agcaatttca ttcacaggct cgagttccgg 
' tccgctagtc gtttcaatga tatcttttgc 
accggcttgg acaagttccg cacttgtttg 
tctttcctca gctgttaagt atacattgcg 
gaccgtatcg acaggaacca attcaatatc 
aacagctacc tgctgcgcat cttttaaacc 
gaaaagcttc gtcagtacga tcgcgacccc 
taacacgtct gtgcgtcttt ctacatgaat 
atgagcatct ggcgtaaaaa gaatatcgac 
ctcaatatcg cgcggatatg cttcaaaatc 
aaaaatactc ataataacgg cgtcgttttc 
cccctcatgc agaaacccca tcgtcggaac 
ttgttttatg gcttctttca gctgtgaaat 
cgtacaagcc gtcaagcact gtctggttca 
cacgatgtct tacatcctga acatatccgc 
gcgtatattg ctttacaaat ttaggtgttc 
aaacgagaac ttgtccgtcc gctttcacac 
gcgtctcggc aattttggct gtgagttctg 
ctcctgcttc ttcgcatttt atactgtctt 
tgccctgtac tttatagccg cccagtacgc 
tgactactgg aatgcctcca agcgtcaatg 
cctcaagctt cagtgcgtca gctccgcttt 
tatcttcctt agacaggtga taagacataa 
gcgcacccct tttaacggct tttgtatgat 
ttgaatcaag gccgaggacg accattccaa 
ccgcttgttc agcaagttta gctgccggat 
cttcagactc cttcattttt agaaaatcca 
gagcgtcctg ctgttgttaa gattattata 
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agcaactgca 


acagcgagaa 


tcatctttcc 2040 


ataggaataa 


agctctacat 


agtctatggt 2100 


agcttttatc 


accgcttcag 


gatctctttc 2160 


aagggcccga 


tacagcttag 


gcgcttcttt 2220 


agagcttttg 


gctaagccgt 


cttcctctct 2280 


catgaagaag 


tcgctgatta 


acccatcaac 2340 


gaaataggca 


cgagtcggct 


tgactagatt 2400 


gtcaaaatgt 


ccttctcttg 


agcgcccgca 24 60 


cgtgacattc 


ttttcaccgg 


gatacatatc 2520 


tccggcgttt 


tctgcaagag 


ctgcatcccg 2580 


ttcattaggg 


i 

ccgaattgtg 


caggattcac 2640 


ttgtcttgct 


ttgtctgcta 


aggttaaatg 27O0 


aaatccgatt 


gacttgccct 


ctgaatggta. 2760 


atcagtaatc 


tgtctcatct 


tattttcccc 2820 


tttgaaagga 


atgcttttgt 


tcagggaaag 2880 


tgattgctgt 


ttcgatggtt 


tcatcaatgc 294 0 


tctcaacacc 


gtggccgata 


atatcatgat: 3000 


cagccccgat 


tccaatgacc 


ggtatgctta .3060 


ccggcacaca 


ttccagcaca 


agcatcatag 3120 


ctattaattt 


tttggcgctt 


tgttcgtctt 3180 


cgactgactg 


cggtgtcaaa 


cctaagtgac 3240 


cgcgaatgga 


ttcaaacacg 


ccttctccgc 3300 


cctgaacgat 


agccgctgca 


tttttcagcg 3360 


acggcatatc 


tgtcacaata 


aaggtattcg 3420 


ggatcatgtc 


cgcaactgtc 


acaccgacag 34 80 


gtgaatcacc 


gactaaaatc 


atgtcaactc 3540 


aatcataagc 


ggtcagcatg 


acaatcggtt 3600 


gttttgtttt 


catgttttct 


cctccttcta 3660 


ccacaccttg 


tagataaagt 


caacaacttt 3720 
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ttgcaaaatt tttcaggaat tttagcagag 
ccgctcttgt gctgttagga tatctttctt 
ttggttaaaa ggaaacaaaa agaccgtttt 
tacagtcaca gccacttttg caaaaaccgg 
gtcgacaagc tbcgcgaagc ggccgcaaaa 
ctgggaaaac cctggcgtta cccaacttaa 
ctggcgtaat agcgaagagg cccgcaccga 
tggcgaatgg cgcctgatgc ggtattttct 
catatggtgc actctcagta caatctgctc 
cccgccaaca cccgctgact atgcttgtaa 
aaaggggacc tctagggtcc ccaattaatt 
aggtcatcca ccggatcagc ttagtaaagc 
ttacttcgcc aactattgcg afcaacaagaa 
tttgtgtagg gcttattatg cacgcttaaa 
tggctgtgag caattatgtg cttagtgcat 
cggcgtcggc ttgaacgaat tgttagacat 
ttcacgtagt ggacaaattc ttccaactga 
tgtccaagat aagcctgtct agcttcaagt 
tccattgccc agtcggcagc gacatccttc 
caaatgcggg acaacgtaag cactacattt 
ttccatagcg ttaaggtttc atttagcgcc 
aagagttcct ccgccgctgg acctaccaag 
aagatagcca gatcaatgtc gatcgtggct 
cgctgccatt ctccaaattg cagttcgcgc 
tcgtgcacaa caatggtgac ttctacagcg 
gaagtttcca aaaggtcgtt gat-caaagct 
accgtaacca gcaaatcaat atcactgtgt 
tacaaatgta cggccagcaa -cgtcggttcg 
gatagttgag tcgatacttc -ggcgatcacc 
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gttgttctgg atgtagaaca aaacatcttt 3780 
<ggaagctagg taggcctcga gttatggcag 3840 
cacacaaaac ggtctttttc gatttctttt 3900 
acagcttcat gccttataac tgctgtttcg 3960 
ttcactggcc gtcgttttac • aacgtcgtga 4020 
tcgccttgca gcacatcccc ctttcgccag 4080 
tcgcccttcc caacagttgc gcagcctgaa 4140 
ccttacgcat ctgtgcggta tttcacaccg 4200 
tgatgccgca tagttaa^cc agccccgaca 42*60 
accgttttgt gaaaaaattt ttaaaataaa 4320 
agtaatataa totattaaag gtcattcaaa 4380 
cctcgctaga ttttaatgcg gatgttgcga- 44 40 
aaagccagcc tttcatgata tatctcocaa 4500 , 
aataataaaa gcagacttga cctgatagtt" 45"60 
ctaacgcttg agttaagccg cgccgcgaag 4 620 
tatttgccga ctaccttggt gatctcgcct 4680 
tctg^cgcgcg aggccaagcg atcttcttot* 4740 
atgacgggct gatactgggc cggcaggcgc 4800 
ggcgcgattt tgccggttac tgcgctgtac 4860 
<:gctcatcgc cagcccagtc gggcggcgag 4920 
tcaaatagat cctgttcagg aaccggat-ca 4 980 
gcaacgctat gttctcttgc ttttgbcagc 5040 
ggctcgaaga tacctgcaag aatgtcattg 5100 
ttagctggat aacgccacgg aatgat-gbcg 5160 
cggagaatct cgctctctcc aggggaagcc 5220 
cgccgcgttg tttcatcaag -ccttacggtc 5280 
ggcttcaggc cgccatccac tgcggagccg 5340 
agatggcgct cgatgacgcc aactacctct 54O0 
gcttcccfcca tgatgtttaa ctttgtttta 5460 
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gggcgactgc cctgctgcgt aacatcgttg ctgctccata acatcaaaca tcgacccacg 5520 
gcgtaacgcg cttgctgctt ggatgcccga ggcatagact gtaccccaaa aaaacagtca 5580 
» taacaagcca tgaaaaccgc cactgcgccg ttaccaccgc tgcgttcggt caaggttctg 5640 
gaccagttgc gtgagcgcat acgctacttg cattacagct tacgaaccga acaggcttat 5700 
gtccactggg ttcgtgcctt catccgtttc cacggtgtgc gtcacccggc aaccttgggc 57 60 
agcagcgaag tcgaggcatt tctgtcctgg ctggcgaacg agcgcaaggt ttcggtctcc 5820 
acgcatcgtc aggcattggc ggccttgctg ttcttctacg gcaaggtgct gtgcacggat 58 80 
ctgccctggc ttcaggagat cggaagacct cggccgtcgc ggcgcttgcc ggtggtgctg 5940 
accccggatg aagtggttcg catcctcggt tttctggaag gcgagcatcg tttgttcgcc 6000 
cagcttctgt atggaacggg catgcggatc agtgagggtt tgcaactgcg ggtcaaggat 6060 
ctggatttcg atcacggcac gatcatcgtg cgggagggca agggctccaa ggatcgggcc 6120 
ttgatgttac ccgagagctt ggcacccagc ctgcgcgagc aggggaattg atccggtgga. 6180 
tgaccttttg aatgaccttt aatagattat attactaatt aattggggac cctagaggtc, 624 0 
ccctttttta ttttaaaaat tttttcacaa aacggtttac aagcataacg ggttttgctg ' 6300 
cccgcaaacg ggctgttctg gtgttgctag tttgttatca gaatcgcaga tccggcttca 6360 
ggtttgccgg ctgaaagcgc tatttcttcc agaattgcca tgattttttc cccacgggag,* 6420 
gcgtcactgg ctcccgtgtt gtcggcagct ttgattcgat aagcagcatc gcctgtttca* -6480 
ggctgtctat gtgtgactgt tgagctgtaa caagttgtct caggtgttca att t cat gt t . 6540 
ctagttgctt tgttttactg gtttcacctg ttctattagg tgttacatgc tgttcatctg 6600 
ttacattgtc gatctgttca tggtgaacag ctttaaatgc accaaaaact cgtaaaagct 6660 
ctgatgtatc tatctttttt acaccgtttt catctgtgca tat-ggacagt tttccctttg 6720 
atatctaacg gtgaacagtt gttctacttt tgtttgttag tcttgatgct tcactgatag 6780 
atacaagagc cataagaacc tcagatcctt ccgtatttag ccagtatgtt ctctagtgtg 6840 
gttcgttgtt tttgcgtgag ccatgagaac gaaccattga gatcatgctt actttgcatg 6900 
tcactcaaaa attttgcctc aaaactggtg agctgaattt ttgcagttaa agcatcgtgt 6960 
agtgtttttc ttagtccgtt acgtaggtag gaatctgatg taatggttgt tggtattttg 7020 
tcaccattca tttttatctg gttgttctca agttcggtta cgagatccat ttgtctatct 7080 
agttcaactt ggaaaatcaa cgtatcagtc gggcggcctc gcttatcaac caccaatttc 7140 
atattgctgt aagtgtttaa atctttactt attggtttca aaacccattg gttaagcctt 7200 
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ttaaactcat ggtagttatt ttcaagcatt aacatgaact taaattcatc aaggctaatc 72*0 
tctatatttg ccttgtgagt tttcttttgt -gttagttctt ttaataacca ctcataaatc 7320 
ctcatagagt atttgttttc aaaagactta acatgttcca gattatattt tatgaatttt 7380 
tttaactgga aaagataagg caatatctct tcactaaaaa ctaattctaa tttttcgctt 74 40 
gagaacttgg catagtttgt ccactggaaa atctcaaagc ctttaaccaa aggattcctg 7S00 
atttccacag ttctcgtcat cagctctctg gfctgctttag ctaatacacc ataagcattt 7560 
tccctactga tgttcatcat ctgagcgtat tggttataag tgaacgatac cgtccgttct 1€20 
ttccttgtag ggttttcaat cgtggggttg agtagtgcca cacagcataa aattagcttg 7S80 
gtttcatgct ccgttaagtc atagcgacta atcgctagtt catttgcttt gaaaacaact 7740 
aattcagaca tacatctcaa ttggtctagg tgattttaat cactatacca attgagatgg 7800 
gctagtcaat gataattact agtccttttc ctttgagttg tgggtatctg taaattctgc 7860 
tagacctttg ctggaaaact tgtaaattct gctagaccct ctgtaaattc cgctagacct 7 920 
ttgtgtgttt tttttgttta tattcaagtg gttataattt atagaataaa gaaagaataa 7 980 
aaaaagataa aaagaataga tcccagccct gtgtataact cactacttta gtcagttccg 8,040 
cagtattaca aaaggatgtc gcaaacgctg tttgctcctc tacaaaacag accttaaaac 8100 
cctaaaggct taagtagcac cctcgcaagc tcgggcaaat cgctgaatat tccttttgtc 3160 
tccgaccatc aggcacctga gtcgctgtct ttttcgtgac attcagttcg ctgcgctcac 8220 
ggctctggca gtgaatgggg gtaaatggca ctacaggcgc cttttatgga ttcatgcaag 8280 
gaaactaccc ataatacaag aaaagcccgt cacgggcttc tcagggcgtt ttatggcggg 5340 
tctgctatgt ggtgctatct gactttttgc tgttcagcag ttcctgccct ctgattttcc 8400 
agtctgacca cttcggatta tcccgtgaca ggtcattcag actggctaat gcacccagta 34^0 
aggcagcggt atcatcaaca ggcttacccg tcttactgtc aac 8503 

<210> 94 
<211> 7381 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : recombinant 
pAN006 plasmid 

<400> 94 

ttgcggccgc ttcgaaagct gtaatataaa aaccttcttc aactaacggg gcaggttagt ~60 
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gacattagaa aaccgactgt aaaaagtaca 
tcattaggcc tatctgacaa ttcctgaata 
tcacaaacag aatgatgtac ctgtaaagat 
atgaattttc ctgctgtaat aatgggtaga 
ttaaacccag taaatgaagt ccatggaata 
ggtgttttgg gaaacaattt ccccgaacca 
tcataaaact ctttgaagtc attctttaca 
acaccatcaa aaattgtata aagtggctct 
ctattgtaac cagttctaaa agctgtattt 
aatgcagggt aaaatttata tccttcttgt 
atttctgtgg ttatactaaa agtcgtttgt 
1 ctcttccaat tgtctaaatc aattttatta 
atctaaagtg aatttaggag gcttacttgt 
taaaagtcaa tattactgta acataaatat 
tttcgtttgt tgaactaatg ggtgctttag 
ggtcttttgt gtttttttaa aggatttgag 
tgataataag ggtaactatt gaattcggta 
tccgtcagga tggccttctg cttaatttga 
cgccaccctc cgggccgttg cttcgcaacg 
ctcaggagag cgttcaccga caaacaacag 
gcctttcgtt ttatttgatg cctggcagtt 
accatcggcg ctacggcgtt tcacttctga 
gctactgccg ccaggcaaat tctgttttat 
tatcaggctg aaaatcttct ctcatccgcc 
aaaccgtttt actctcaaaa tcttaaaaga 
caaaattgta cgggctggtt cgttccccag 
aacagccact ttcggctcat ggcttgccgc 
aataatgacc ttatctcctt cctgcacaag 
gccgcttccc cgtttaccag gaataatata 
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gtcggcatta tctcatatta taaaagccag 120 
gagttcataa acaatcctgc atgataacca 180 
agcggtaaat atattgaatt acctttatta 240 
aggtaattac tattattatt gatatttaag 300 
atagaaagag aaaaagcatt ittcaggtata 360 
ttatatttct ctacatcaga aaggtataaa 420 
ggagtccaaa taccagagaa tgttttagat 480 
aacttatccc aataacctaa ctctccgtcg 540 
gagtttatca cccttgtcac taagaaaata 600 
tttatgtttc ggtataaaac actaatatca 660 

tggttcaaat aatgattaaa tatctctttt 720 

i 

aagttcattt gatatgcctc ctaaattttt 780 
ctgctttctt cattagaatc aatccttttt 840 
atattttaaa aatatcccac tttatccaat 900 
ttgaagaata aagaccacat taaaaaatgt 960 
cgtagcgaaa aatccttttc tttcttatct 1020 
ccaagagttt gtagaaacgc aaaaaggcca 1080 
tgcctggcag tttatggcgg gcgtcctgcc 1140 
ttcaaatccg ctcccggcgg atttgtccta ' 1200 
ataaaacgaa aggcccagtc tttcgactga 1260 
ccctactctc gcatggggag accccacact 1320 
gttcggcatg gggtcaggtg ggaccaccgc 1380 
cagaccgctt ctgcgttctg atttaatctg 1440 
aaaacaggat cctacggaaa tggagcggca 1500 
aaacccccga taaagggggc ttttcttcta 1560 
catttgttca attttgtttt gatcattcag 1620 
ttcttgatca gacatcattt tgtaggaaat 1680 
gcgtgcggct gcaccgttta agcatatgac 1740 
cgtttcaaga cgtgctccat tattattatt 1800 
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cacaatttgt actttttcat taggaagcat tcccacagca tcaatgagat *ctt<:atcaat 1860 
tgtaatgctt cccacatagt bcaggtttgc ttccgtaaca gttgccctgt ^aagtttgcc 1920 
'gctcatcatt gttcgataca tattatattc tctccatttc tcgaatatca ataatgatat 1980 
tatctattaa acgcgctttt gaaaaagcaa ctgcaacagc gagaatcatc tttccagcaa 2040 
tttcattcac aggctcgagt tccggatagg aataaagctc tacatagt-ct atggttccgc 2100 
tagtcgtttc aatgatatct tttgcagctt ttatcaccgc tbcaggafcct <±ttcaccgg 2160 
cttggacaag ttccgcactt gtttgaaggg cccgatacag cttaggcgct tcttttcttt 2220 
cctcagctgt taagtataca ttgcgagagc ttttggctaa ^ccgtcttcc tctctgaccg 2280 
tatcgacagg aaccaattca atatccatga agaagtegct gattaaccca tcaacaacag 2340 
ctacctgctg cgcatctttt aaaccgaaat aggcacgagt cggcttgact agattgaaaa 2400 
gcttcgtcag tacgatcgcg accccgtcaa aatgtccttc tcttgagcgc ccgcataaca 2460 
cgtctgtgcg tctttctaca tgaatcgtga cattcttttc accgggatac atatcatgag 2520 
catctggcgt aaaaagaata tcgactccgg cgttttctgc aagagctgca tcccgcbcaa 2580 
tatcgcgcgg atatgcttca aaatcttcat tagggccgaa ttgtgeagga ttcacaaaaa- 2640 
tactcataat aacggcgtcg ttttcttgtc ttgctttgtc tgctaaggtt aaatgcccct 2700 
catgcagaaa ccccatcgtc ggaacaaatc cgattgactt gccctctgaa tggtattgtt: 2760 
ttatggcttc tttcagctgt gaaatatcag taatx:tgtct catcttattt tcccccgtac 2820 
aagccgtcaa gcactgtctg gttcatttga aaggaatgct tttgttcagg gaaagcacga 2880 
tgtcttacat cctgaacata tccgctgatt gctgtttcga tggtttcatc aatgcgcgta 2940 
tattgcttta caaatttagg tgttctctea acaccgtggc cgataatabc atgataaacg 3000 
. agaacttgtc cgtccgcttt cacaccagcc ocgattccaa tgaccggtat gcttagcgtc 3060 
tcggcaattt tggctgtgag ttctgccggc acacattcca gcacaagcat catagctcct 3120 
gcttcttcgc attttatact gtcttetatt aattttttgg -cgctttgttc gtcttt^ccc 3180 
tgtactttat agccgcccag tacgccgact gactgcggtg tcaaacctaa gtgacfcgact 3240 
actggaatgc ctccaagcgt caatgcgcga atggattcaa acacgccttc tccgccctca 3300 
agcttcagtg cgtcagctcc gctttcctga acgatagccg ctgcattttt cagcgtatct 3360 
tccttagaca ggtgataaga cataaacggc atatctgtca oaataaaggt attcggcgca 3420 
ccccttttaa cggcttttgt atgatggatc atgtccgcaa ctgtcacacc gacagttgaa 3480 
tcaaggccga ggacgaccat tccaagtgaa tcaccgacta aaatcatgtc aactcccgct 3540 
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tgttcagcaa 


gtttagctgc 


cggataatca 


gactccttca 


tttttagaaa 


atccagtttt 


cctgctgttg 


ttaagattat 


tataccacac 


aatttttcag 


gaattttagc 


agaggttgtt 


ttgtgctgtt 


aggatatctt 


tcttggaagc 


aaaaggaaac 


aaaaagaccg 


ttttcacaca 


cacagccact 


tttgcaaaaa 


ccggacagct 


ctgcaggcat 


gcaagcttcg 


cgaagcggcc 


tatgattctc 


ctcgcttccg 


gcggcatcgg 


gcaggtagat 


gacgaccatc 


agggacagct 


aacttcgatc 


actggaccgc 


tgatcgtcac 


gaacgggttg 


gcatggattg 


taggcgccgc 


tcgcggtgca 


tggagccggg 


ccacctcgac 


ggattcacca 


ctccaagaat 


tggagccaat 


accaaccctt 


ggcagaacat 


atccatcgcg 


atctcgggca 


gcgttgggtc 


ctggccacgg 


acccggctag 


gctggcgggg 


ttgccttact 


cgaacgtgaa 


gcgactgctg 


ctgcaaaacg 


ttcggtttcc 


gtgtttcgta 


aagtctggaa 


ttccggatct 


gcatcgcagg 


atgctgctgg 


acgaagcgct 


ggcattgacc 


ctgagtgatt 


agttgtttac 


cctcacaacg 


ttccagtaac 


gtgagcatcc 


tctctcgttt 


catcggtatc 


cacggaggca 


tcaagtgacc 


aaacaggaaa 


gaagccagac 


attaacgctt 


ctggagaaac 


acatctgtga 


atcgcttcac 


gaccacgctg 


tcggtgatga 


cggtgaaaac 


ctctgacaca 


tgtaagcgga 


tgccgggagc 


agacaagccc 


gtcggggcgc 


agccatgacc 


cagtcacgta 
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taagcggtca gcatgacaat cggttcttca' 3600 
gttttcatgt tttctcctcc tctagagcgt 3660 
cttgtagata aagtcaacaa ctttttgcaa 3720 
ctggatgtag aacaaaacat ctttccgctc 3780 
taggtaggcc tcgagttatg gcagttggtt 3840 
aaacggtctt tttcgatttc. tttttacagt 3900 
tcatgcctta taactgctgt ttcggtcgac 3960 
gccgacgcga ggctggatgg ccttccccat 4020 
gatgcccgcg ttgcaggcca tgctgtccag 4080 
tcaaggatcg ctcgcg^ctc ttaccagcct 4140 
ggcgatttat gccgcctcgg cgagcacatg 4200 
cctatacctt gtctgcctcc ccgcgttgcg 4260 
ctgaatggaa gccggcggca cctcgctaac. 4320 
caattcttgc ggagaactgt gaatgcgcaa 4380 
tccgccatct ccagcagccg cacgcggcgc 4440 
gtgcgcatga tcgtgctcct gtcgttgagg;. 4500 
ggttagcaga atgaatcacc gatacgcgag 4560 
tctgcgacct gagcaacaac atgaatggtc 4620 
acgcggaagt cagcgccctg caccattatg 4 680 
ctaccctgtg gaacacctac atctgtatta 4740 
tttctctggt cccgccgcat ccataccgcc 4800 
cgggcatgtt catcatcagt aacccgtatc 4860 
attaccccca tgaacagaaa ttccccctta 4 920 
aaaccgccct taacatggcc cgctttatca 4 980 
tcaacgagct ggacgcggat gaacaggcag 5040 
atgagcttta ccgcagctgc ctcgcgcgtt 5100 
tgcagctccc ggagacggtc acagcttgtc 5160 
gtcagggcgc gtcagcgggt gttggcgggt 5220 
gcgatagcgg agtgtatact ggcttaacta 5280 
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tgcggeatca gagcagattg tactgagagt gcaecatatg cggtgtgaaa taccgcacag 5340 
atgcgtaagg agaaaatacc gcatcaggcg ctcttccget tcctcgctca -ctgact-cgct 5400 
gcgctcggtc gttcggctgc ggcgagcggt atcagctcac tcaaaggcgg taatacggtt 5460 ( 
atccacagaa tcaggggata acgcaggaaa gaacatgtga gcaaaaggcc agcaaaaggc 5520 • 
caggaaccgt aaaaaggccg cgttgctggc gtttttccat aggctccgcc cccctgacga 5580' 
gcatcacaaa aatcgacgct caagtcagag gtggcgaaac -ccgacaggac tataaagata 5640 
ccaggcgttt ccccctggaa gctccctcgt gcgctctocct gttccgaccc tgccgcttac 5700 
cggatacctg tccgcctttc tcccttcggg aagcgtggcg ctttctcata gctcacgctg 5760 
taggtatctc agttcggtgt aggtcgttcg ctccaagctg ggctgtgtgc acgaaccccc 5820 
cgttcagccc gaccgctgcg ccttatccgg taactatcgt cttgagtcca acccggtaag 5880 
( acacgactta tcgccactgg cagcagccac tggtaacagg attagcagag -cgaggtatgt 5940 
aggcggtgct acagagttct tgaagtggtg gcctaactac ggctacacta gaaggacagt €000 
atttggtatc tgcgctctgc tgaagccagt taccttcgga aaaagagttg gtagctcttg 6060 
atccggcaaa caaaccaccg ctggtagcgg tggttttttt gtttgcaagc agcagattac 6120 
gcgcagaaaa aaaggatctc aagaagatcc tttgatcttt tctacggggt 'ctgacgctea €1*0 
gtggaacgaa aactcacgtt aagggatttt ggtcatgaga ttatcaaaaa ggatcttcac; €240 
ctagatcctt ttaaattaaa aatgaagttt taaatcaatc taaagtatat atgagtaaac -6300 
ttggtctgac agttaccaat gcttaatcag tgaggcacct atctcagcga tctgtotatt 6360 
tcgttcatcc atagttgcct gactccccgt cgtgtagata actacgatac gggagggctt 6420 
accatctggc cccagtgctg caatgatacc gcgagaccca cgctcaccgg ct-ccagattt 6480 
atcagcaata aaccagccag ccggaagggc cgagcgcaga agtggt<sctg caactttatc 6540 
cgcctccatc cagtctatta attgttgccg ggaagctaga gtaagtagtt cgccagttaa -6600 
tagtttgcgc aacgttgttg ccattgctgc aggcatcgtg gtgtcacgct cgtcgtttgg -6660 
tatggcttca ttcagctccg gttcccaacg atcaaggcga gttacatgat cccccatgtt 6720 
gtgcaaaaaa gcggttagct ccttcggtcc tccgatcgtt gtcagaagta agttggccgc 67SO 
agtgttatca ctcatggtta tggcagcact gcataattct cttactgtca tgccatccgt €840 
aagatgcttt tctgtgactg gtgagtactc aaocaagtca ttctgagaat agtgtatgcg 6900 
gcgaccgagt tgctcttgcc cggcgtcaat acgggataat accgcgccac atagcagaac 6960 
tttaaaagtg ctcatcattg gaaaacgttc ttcggggcga aaactctcaa ggatcttacc 7020 
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gctgttgaga tccagttcga tgtaacccac 
tactttcacc agcgtttctg ggtgagcaaa 
aataagggcg acacggaaat gttgaatact 
catttatcag ggttattgtc tcatgagcgg 
acaaataggg gttccgcgca catttccccg 
tattatcatg acattaacct ataaaaatag 
a 
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tcgtgcaccc aactgatctt cagcatcttt 7080 
aacaggaagg caaaatgccg caaaaaaggg 7140 
catactcttc ctttttcaat attattgaag 7200 
atacatattt gaatgtattt agaaaaataa 7260 
aaaagtgcca cctgacgtct aagaaaccat 7320 
gcgtatcacg aggccctttc gtcttcaaga 7380 

7381 
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This International Searching Authority found multiple {groups of) 
inventions in this international application, as follows: 

1. Claims: 1-6,12, 13,24,26-28, 33-35,48,51,54-64, 71, 76,78-86, 
92/93,95,97-160,102,103 {all partially) 



A method of producing pantothenate (e.g., 2 g/L up to 40 g/L 
at least), pantoate, or ketopantoate, e.g., in a manner 
independent of precursor feed, comprising ciilturing a 
microorganism {e.g., Gram-positive {e.g., belonging to the 
genus Bacillus, Corynebacterium, Lactobacillus, Lactococci 
or Streptomyces) or Gram- negative) which overexpresses a 
ketopantoate hydroxymethyltransferase^encoding gene, e.g. 
the panB gene, e.g., from Bacillus, under conditions such 
that said panto-compound is produced, ami possibly further 
recovering the compound. A recombinant microorganism (e.g., 
Gram-positive {e.g., belonging to the genus Bacillus {e.g., 
Bacillus subtil is), Corynebacterium, Lactobacillus, 
Lactococci or Streptomyces)) which overexpresses a Bacillus 
( subti lis) ketopantoate hydroxymethyUransf erase-encodi ng 
gene. A recombinant vector encoding a Bacillus (subtilis) 
ketopantoate hydroxymethy Itransferase -encoding gene operably 
linked to regulatory sequences, e.g., comprising a nucleic 
acid sequence according to SEQ ID N0:23 or part of SEQ ID 
M0:59. An isolated nucleic acid molecule encoding a Bacillus 
(subtilis) ketopantoate hydroxymethy Itransferase, and -said 
isolated ketopantoate hydroxymethy Itransferase polypeptide. 

2. Claims: 1-6, 12, 13, 24,26-28, 33-35, 48,51,54-64, 71, 76,78-86, 

92,93,95,97-100,102,103 (all partially) 



A method of producing panthotenate {e.g. , 2 g/L up to 40 g/L 
at least), pantoate, or ketopantoate, e.g., in a manner 
independent of precursor feed, comprising cuKuring a 
microorganism (e.g., Gram-positive {e.g., belonging to the 
genus Bacillus, Corynebacterium, Lactobacillus, Lactococci 
or Streptomyces) or Gram-negative) which overexpresses a 
pantothenate synthetase-encoding gene, e.g. the panC gene, 
e.g., from Bacillus, under conditions such that said 
panto- compound is produced, and possibly further recovering 
the compound. A recombinant microorganism {e.g.. 
Gram-positive {e.g., belonging to the genus Bacillus (e.g., 
Bacillus subtilis), Corynebacterium, Lactobacillus, 
Lactococci or Streptomyces)) which overexpresses a Bacillus 
(subtilis) pantothenate synthetase-encoding gene. A 
recombinant vector encodi ng a Bacillus (subtilis) 
pantothenate synthetase-ervcodi ng gene operably linked to 
regulatory sequences, e.g., comprising a nucleic acid 
sequence according to SEQ ID N0:25 or part of SEQ 10 N0:59. 
An isolated nucleic acid molecule encoding a Bacillus 
(subtilis) pantothenate synthetase, and said isolated 
pantothenate synthetase polypeptide. 
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3. Claims: 1-6,12-14,24,26-28,33-35,48,49,51,54-64,66,71,76, 

78-86,92,93,95,97-100,102,103 (all partially): 15, 
17,19,23,32,106,107 (both completely) 

A method of producing pantothenate (e.g., 2 g/L up to 40 g/L 
at least), pantoate, or ketopantoate, e.g., independent of 
aspartate or beta-alanine feed, comprising culturing a 
microorganism (e.g., Gram-positive (e.g., belonging to the 
genus Bacillus, Corynebacterium, Lactobacillus, Lactococci 
or Streptomyces) or Gram-negative) which overexpresses an 
aspartate-alpha-decarboxylase-encoding gene, e.g., from 
Bacillus, e.g., the aspartate-alpha-decarboxylase-encoding 
panD gene from Bacillus subtil is, under conditions such that 
said panto-compound is produced, and possibly further 
recovering the compound. A recombinant microorganism (e.g., 
Gram-positive (e.g., belonging to the genus Bacillus (e.g., 
Bacillus subtil is), Corynebacterium, Lactobacillus, 
Lactococci or Streptomyces)) which overexpresses a Bacillus 
(subtil is) aspartate-alpha-decarboxylase-encoding gene. A 
recombinant vector encoding a Bacillus (subtil is) 
aspartate-alpha-decarboxylase-encoding gene operably linked 
to regulatory sequences, e.g., comprising a nucleic acid 
sequence according to SEQ ID N0:27 or part of SEQ 10 NO: 59. 
An isolated nucleic acid molecule encoding a Bacillus 
(subtil is) aspartate-alpha-decarboxylase, and said isolated 
aspartate-al pha-decarboxyl ase polypepti de. 

4. Claims: 1-6,24,26-28,33-35,48,49,51,54-64,71,76,78-87,92, 

93,95,97-100,102,103 (all partially); 7-11,65,101, 
104,105 (all completely) * 



A method of producing pantothenate (e.g., 2 g/L up to 40 g/L 
at least), pantoate, or ketopantoate, e.g., independent of 
precursor feed, comprising culturing a microorganism (e.g., 
Gram-positive (e.g., belonging to the genus Bacillus, 
Corynebacterium, Lactobacillus, Lactococci or Streptomyces) 
or Gram-negative) which overexpresses a ketopantoate 
reductase-encoding gene, e.g., from Bacillus, e.g., the 
ketopantoate reductase-encoding panEl gene from Bacillus 
subtil is, under conditions such that said panto-compound is 
produced, and possibly further recovering the compound. A 
recombinant microorganism (e.g., Gram-positive (e.g., 
belonging to the genus Bacillus (e.g., Bacillus subtil is), 
Corynebacterium, Lactobacillus, Lactococci or Streptomyces)) 
which overexpresses a Bacillus (subtil is) ketopantoate 
reductase-encoding gene. A recombinant vector encoding a 
Bacillus (subtil is) ketopantoate reductase-encoding gene 
operably linked to regulatory sequences, e.g., comprising a 
nucleic acid sequence according to SEQ ID NO: 29. An isolated 
nucleic acid molecule encoding a Bacillus (subtilis) 
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ketopantoate reductase, and -said isolated ketopantoate 
reductase polypeptide. 

5. Claims: 14,16,18,28,48,54-61,66,77-82, ■ 
97 (all partially); 20,29 Completely) 



A method of producing pantothenate (e.g., 2 g/L up to 40 g/L 
at least), e.g., in a manner independent of valine or 
alpha-ketoisovalerate feed, comprising culturing a 
microorganism {e.g., Gram-positive {e.g., belonging to the 
genus Bacillus, Corynebacterium, Lactobacillus, Lactococci 
or Streptomyces) or Gram-negative ) having a deregulated 
i sol eucine- valine (ilv) pathway, wherein the microorganism 
overexpresses aceiohydroxyacid synthase or is transformed 
with a vector comprising an ilvBM nucleic acid sequence or 
an alsS sequence, e.g., f rom £aci 1 1 us , under conditions such 
that pantothenate is produced, and possibly further 
recovering the pantothenate. Said microorganism (e.g., 
Gram-positive (e.g., belonging to the genus Bacillus (e.g., 
Bacillus subtil is), torynebacterium, Lactobacillus, 
Lactococci or Streptomyces)) , and said vector. 



6. Claims: 14,16,18,28,48,54-61,66,77-82, 

97 (all partially); -21, 30 (completely) 



A method of producing pantothenate {e.g., 2 g/L up to 40 g/L 

at least), e.g., in a manner independent of valine or. 

alpha-ketoisovalerate feed, comprising culturing a 
microorganism (e.g., Gram-positive {e.g., belonging to the 
genus Bacillus, Corynebacterium, Lactobacillus, Lactococci 
or Streptomyces) or Gram-negative) having a deregulated 
isoleucine- valine {ilv) pathway, wherein the microorganism 
overexpresses acetohydroxyacid isomeroreductase or is 
transformed with a vector comprising an ilvC nucleic acid 
sequence, e.g., from Bacillus, under -conditions such that 
pantothenate is produced, and possibly further recovering 
the pantothenate. Said microorganism (e.g., Gram-positive 
(e.g., belonging to the genus Bacillus (e.g., Bacillus 
subtil is), Corynebacterium, Lactobacillus, Lactococci or 
Streptomyces)), and said vector. 



7. Claims: 14, 16, 18,28,48,54-61/66,77-82,97 {partially); 22, 
31 (completely) 



A method of producing pantothenate {e.g., 2 g/L up to 40 g/L 
at least), e.g., in a manner independent of valine or 
alpha-ketoisovalerate feed, comprising culturing a 
microorganism (e.g., Gram-positive fc.g., belonging to the 
genus Bacillus, Corynebacterium, Lactobacillus, Lactococci 
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or Streptomyces) or Gram-negative) having a deregulated 
isoleucine-valine (ilv) pathway, wherein the microorganism 
oyerexpresses dihydroxyacid dehydratase or is transformed 
with a vector comprising an ilvD nucleic acid sequence 
e.g., from Bacillus, under conditions such that pantothenate 
is produced, and possibly further recovering the Ht " ,LUU,enate 
pantothenate. Said microorganism (e.g., Gram-positive (e o 
belonging to the genus Bacillus (e.g., Bacillus subtilis) 
Corynebactenum, Lactobacillus, Lactococci or :^uut1lls;, 
Streptomyces)), and said vector. 

8. Claims: 25,28,50,54-61 (all partially) 

A method of producing pantothenate (e.g., 2 g/L up to 40 q/L 
at 1 east J, e.g., in a manner independent of precursor feed 
comprising cu turmg a microorganism (e.g., Gram-positive 
(e.g be onging to the genus Bacillus, Corynebacteri urn, 
Lactobacillus, Lactococci or Streptomyces) or Gram-negative) 
having a mutant avtA gene under conditions such that 

the JaiJtSthlnate^ 0 ' 1 " 06 ^ and possibl y further recovering 

9. Claims: 25,28,50,54-61 (all partially) 

A method of producing pantothenate (e.g., 2 g/L up to 40 q/L 
at least), e.g., in a manner independent of precursor feed 
comprising cultunng a microorganism (e.g., Gram-positive ' 
(e.g., be onging to the genus Bacillus, Corynebacteri urn, 
Lactobacillus, Lactococci or Streptomyces) or Gram-negative) 
having a mutant ilvE gene under conditions such that J " 
pantothenate is produced, and possibly further recoverinq 
the pantothenate. 3 

10. Claims: 25,28,50,54-61 (all partially) 

A method of producing pantothenate (e.g., 2 g/L up to 40 g/L 
at least), e.g., in a manner independent of precursor feed 
comprising cultunng a microorganism {e.g., Gram-positive ' 

ttkL • ?" 9in ? *2 the ? enus Baci H"s, Corynebacteri urn, 
Lactobacillus Lactococci or Streptomyces) or Gram-negative) 
having a mutant ansB gene under conditions such that 
?Stae N " Ced ' and possibly further recovering 

11. Claims: 25,28,50,54-61 (all partially) 

A method of producing pantothenate (e.g., 2 g/L up to 40 q/L 
at least), e.g., in a manner independent of precursor feed 
comprising cultunng a microorganism {e.g., Gram-positive 
(e.g., be onging to the genus Bacillus, Corynebacteri urn, 
Lactobacillus, Lactococci or Stre ptomyces) or €ram-negative) 
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having a mutant also gene under conditions such that 
pantothenate is produced, and possibly further recovering 
the pantothenate. 



12. Claims: 36,37,54-59,61 {all partially) 

A method of producing teta-alanine, comprising culturing a 
microorganism (e.g., Gram-positive (e.g., belonging to the 
genus Bacillus, Corynebacterium, Lactobacillus, Lactococci 
or Streptomyces) or Gram-negative) which overexpresses an 
aspartate-alpha-decarboxylase-encoding gene, under 
conditions such that beta-alanine is produced, wherein the 
asparte-alpha-decarboxylase-overexpressing microorganism has 
a mutation in a nucleic acid sequence encoding ketopantoate 
hydroxymethyl transferase (panB), and possibly further 
recovering the compound. 



13. Claims: 36,37,54-59,61 (all partially) 

A method of producing beta-alanine, comprising culturing a 
microorganism (e.g., *Gram-positive (e.g., belonging to the 
genus Bacillus, Corynebacterium, Lactobacillus, Lactococci 
or Streptomyces) or Gram-negative) which overexpresses an 
aspartate-alpha-decarboxylase-encoding gene, under 
conditions such that beta-alanine is produced, wherein the 
asparte-alpha-decarboxylase-overexpressing microorganism has 
a mutation in a nucleic acid sequence encoding ketopantoate 
reductase (panE), and possibly further recovering the 
compound. . 



14. Claims: 36,37,54-59,61 (all partially) 

A method of producing beta-alanine, comprising culturing a 
microorganism (e.g., Gram-positive (e.g., belonging to the 
genus Bacillus, Corynebacterium, Lactobacillus, Lactococci 
or Streptomyces) or Gram-negative) which overexpresses an 
aspartate-alpha-decarboxylase-^ncoding gene, under 
conditions such that beta-alanine is produced, wherein the 
asparte-alpha-decarboxylase-overexpressing microorganism has 
a mutation in a nucleic acid sequence encoding pantothenate 
synthetase (panC) , and possibly further recovering the 
compound. 



15. Claim : 38 (completely) 

A method of producing beta-alanine comprising contacting a 
composition comprising aspartate with an isolated Bacillus 
aspartate-alpha-decarboxylase enzyme under conditions such 
that -beta-alanine is produced. 
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16. Claims: 41,44-47,51,53,54-61,69,71,72,75,78-81, 

?L ( ?H Dar * ia11 y); 39,43,52,67,70,74,88-91, 

108-110 (all completely) 

A method for producing or for enhancing production of 
ketopantoate, pantoate, or pantothenate (e.g., 10, 20 or 40 
g/L at least), comprising culturing a mutant microorganism 
(e.g., Gram-positive (e.g., belonging to the genus Bacillus. 
Corynebacterium, Lactobacillus, Lactococci or Streptomyces) 
or Gram-negative) having a mutant pantothenate 
kinase-encoding coaX gene, under conditions such that said 
panto-compound is produced or that production is enhanced 
and possibly further recovering the compound. A method for 
identifying compounds which modulate pantothenate kinase 
activity comprising contacting a recombinant cell exoressina 
the coaX gene, possibly further comprising a mutant coaA 
gene encoding a pantothenate kinase with reduced activity 
with a test compound and determining the ability of the test 
compound to modulate pantothenate kinase activity in said 
cell. A recombinant microorganism having a mutant coaX qene 
encoding a pantothenate kinase with reduced activity A 
vector comprising a mutant coaX gene encoding a pantothenate 
kinase with reduced activity, possibly further comprisinq 
regulatory sequences. A recombinant microorganism comprisinq 
a vector comprising an isolated coaX gene (e.g., from 9 
Bacillus (subtil is)), and said vector, possibly further 
comprising regulatory sequences, e.g., a constitutivelv 
active promoter. A recombinant microorganism (e.g 
Gram-positive (e.g., belonging to the genus Bacillus (e.g 
Bacillus subtil is), Corynebacterium, Lactobacillus, 
Lactococci or Streptomyces)) that overproduces a 
panto-compound having a mutation in a coaX gene that results 1 
in a reduced level of pantothenate kinase activity, 
resulting in a decrease in the capacity of the microorganism 
to synthesize coenzyme A. An isolated nucleic acid molecule 
comprising a (mutant) coaX gene, and an isolated 
pantothenate kinase protein encoded by a coaX gene. 

17. Claims: 41,44-47,51,53-61,69,71,72,75,78-81, 

97 (all partially); 40,42,68,73 (all completely) 

A method for producing or for enhancing production of a 
panto-compound, e g. ketopantoate, pantoate or pantothenate 
(e.g., 10, 20 or 40 g/L at least), comprising culturinq a 
mutant microorganism (e.g., Gram-positive (e.g., belonging 
to the genus Bacillus, Corynebacterium, Lactobacillus 
Lactococci or Streptomyces) or Gram-negative) having a 
mutant pantothenate kinase-encoding coaA gene, under 
conditions such that the panto- compound is produced or that 
production is enhanced, and possibly further recovering the 
panto-compound. A recombinant microorganism having a mutant 
coaA gene encoding a pantothenate kinase with reduced 
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activity. A recombinant microorganism (e.g., Gram-positive 
{e.g., belonging to the genus Bacillus (e.g., Bacillus 
subtil is), Corynebacterium, Lactobacillus, Lactococci or 
Streptomyces)) that overproduces a panto-compound having a 
mutation in a coaA gene that results in a reduced level of 
pantothenate kinase activity, resulting in a decrease in the 
capacity of the microorganism to synthesize coen?yme A. A 
vector containing a (mutated) coaA gene. 




18. 


Claim : 94 (partially) 

A vector containing regulatory sequences comprising the 
constitutively active promoter Pveg (SEX) 10 NO:41). 




19. 


Claim : 94 (partially) 

A vector containing regulatory sequences comprising the 
constitutively active promoter P15 (SEQ ID NO: 39). 




20. 


Claim : 94 (partially) 

A vector containing regulatory sequences comprising the 
constitutively active promoter P26 {SEQ ID N0:40). 




21. 


Claim : 96 (partially) 

A vector containing regulatory sequences comprising an 
artificial RBS according to SEQ ID NO:49. 




22. 


Claim : 96 (partially) 

A vector containing regulatory sequences comprising an 
artificial RBS according to SEQ ID N0:50. 




23. 


Claim : 96 (partially) 

A vector containing regul atory sequences comprising an 
artificial RBS according *o SEQ ID N0:51. 




24. 


Claim : 96 (partially) 

A vector containing regulatory sequences comprising an 
artificial RBS according to SEQ ID NO: 52. 




25. 


Claim : 96 {partially) 

A vector containing regulatory sequences comprisifKj an 
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artificial RBS according to SEQ ID NO: 53. 

26. Claim : 96 (partially) 

A vector containing regulatory sequences comprising an 
artificial RBS according to SEQ ID NO: 54. 

27. Claim : 96 (partially) 

A vector containing regulatory sequences comprising an 
artificial RBS according to SEQ ID NO: 55. 

28. Claim : 96 (partially) 

A vector containing regulatory sequences comprising an 
artificial RBS according to SEQ ID NO: 56. 

29. Claim : 96 (partially) 

A vector containing regulatory sequences comprising an 
artificial RBS according to SEQ ID NO: 57. 
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